Changes between Version 5 and Version 6 of SOPs/variant_calling
- Timestamp:
- 12/11/13 16:14:22 (11 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
SOPs/variant_calling
v5 v6 5 5 * mapping short reads 6 6 * calling raw variants 7 * adding filters (really more like 'tagging' ) and some annotations to variants8 * annotating effect(s) of variants on genes 7 * adding filters (really more like 'tagging' to identify raw variants that are really variants and not technical errors) and some annotations to variants 8 * annotating effect(s) of variants on genes (like if they change protein sequence) 9 9 10 10 == Map short reads == … … 38 38 }}} 39 39 40 Get uniquely mapping reads ( ?)40 Get uniquely mapping reads (Is this recommended or not?) 41 41 {{{ 42 42 samtools view -h A_reads.bt2.bam | grep -v XS:i: | samtools view -bS - > A_reads.bt2.sorted_unique.bam 43 43 }}} 44 44 45 Remove duplicated reads assumed to be PCR artifacts (?)45 Remove duplicated reads assumed to be PCR artifacts. This should be done even if duplication rate is low. 46 46 {{{ 47 47 # paired-end reads … … 53 53 == Call raw variants with mpileup+bcftools == 54 54 55 Call variants (one sample vs. reference) with samtools' mpileup+bcftools (see the [http://samtools.sourceforge.net/mpileup.shtml| samtools' variant calling page] for more details) 55 Call variants (one sample vs. reference) with samtools' mpileup+bcftools (see the [http://samtools.sourceforge.net/mpileup.shtml| samtools' variant calling page] for more details). 56 In our experience, "-B" is needed to disable BAQ computation, which would otherwise remove many obvious variants. 56 57 {{{ 57 samtools mpileup -d100000 - uf /nfs/genomes/sgd_2010/bwa/sacCer3.fa A_reads.bt2.sorted_unique.bam | bcftools view -bvcg - >| A_reads.bt2.sorted_unique.raw.bcf58 samtools mpileup -d100000 -Buf /nfs/genomes/sgd_2010/bwa/sacCer3.fa A_reads.bt2.sorted_unique.bam | bcftools view -bvcg - >| A_reads.bt2.sorted_unique.raw.bcf 58 59 }}} 59 60 Call variants (multiple sample vs. reference) using a set of BAM files 60 61 {{{ 61 samtools mpileup -d100000 - uf /nfs/genomes/sgd_2010/bwa/sacCer3.fa *_reads.bt2.sorted_unique.bam | bcftools view -bvcg - >| ALL_reads.bt2.sorted_unique.raw.bcf62 samtools mpileup -d100000 -Buf /nfs/genomes/sgd_2010/bwa/sacCer3.fa *_reads.bt2.sorted_unique.bam | bcftools view -bvcg - >| ALL_reads.bt2.sorted_unique.raw.bcf 62 63 }}} 63 64 … … 66 67 This step uses vcf-annotate from the [http://vcftools.sourceforge.net/docs.html VCFtools suite] 67 68 68 Annotate variants by adding tags ("filters" but all variants are kept) to each variant, using all default filters 69 Annotate variants by adding tags ("filters" but all variants are kept) to each variant, using all default filters. 69 70 {{{ 70 71 bcftools view -L -vcg A_reads.bt2.sorted_unique.raw.bcf | vcf-annotate -f + > A_reads.bt2.sorted_unique.withTags.bcf