Changes between Version 5 and Version 6 of SOPs/variant_calling


Ignore:
Timestamp:
12/11/13 16:14:22 (11 years ago)
Author:
gbell
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • SOPs/variant_calling

    v5 v6  
    55* mapping short reads
    66* calling raw variants
    7 * adding filters (really more like 'tagging') and some annotations to variants
    8 * annotating effect(s) of variants on genes
     7* adding filters (really more like 'tagging' to identify raw variants that are really variants and not technical errors) and some annotations to variants
     8* annotating effect(s) of variants on genes (like if they change protein sequence)
    99
    1010== Map short reads ==
     
    3838}}}
    3939
    40 Get uniquely mapping reads (?)
     40Get uniquely mapping reads (Is this recommended or not?)
    4141{{{
    4242samtools view -h A_reads.bt2.bam | grep -v XS:i: | samtools view -bS - > A_reads.bt2.sorted_unique.bam
    4343}}}
    4444
    45 Remove duplicated reads assumed to be PCR artifacts (?)
     45Remove duplicated reads assumed to be PCR artifacts.  This should be done even if duplication rate is low.
    4646{{{
    4747# paired-end reads
     
    5353== Call raw variants with mpileup+bcftools ==
    5454
    55 Call variants (one sample vs. reference) with samtools' mpileup+bcftools (see the [http://samtools.sourceforge.net/mpileup.shtml| samtools' variant calling page] for more details)
     55Call variants (one sample vs. reference) with samtools' mpileup+bcftools (see the [http://samtools.sourceforge.net/mpileup.shtml| samtools' variant calling page] for more details).
     56In our experience, "-B" is needed to disable BAQ computation, which would otherwise remove many obvious variants.
    5657{{{
    57 samtools mpileup -d100000 -uf /nfs/genomes/sgd_2010/bwa/sacCer3.fa A_reads.bt2.sorted_unique.bam | bcftools view -bvcg - >| A_reads.bt2.sorted_unique.raw.bcf
     58samtools mpileup -d100000 -Buf /nfs/genomes/sgd_2010/bwa/sacCer3.fa A_reads.bt2.sorted_unique.bam | bcftools view -bvcg - >| A_reads.bt2.sorted_unique.raw.bcf
    5859}}}
    5960Call variants (multiple sample vs. reference) using a set of BAM files
    6061{{{
    61 samtools mpileup -d100000 -uf /nfs/genomes/sgd_2010/bwa/sacCer3.fa *_reads.bt2.sorted_unique.bam | bcftools view -bvcg - >| ALL_reads.bt2.sorted_unique.raw.bcf
     62samtools mpileup -d100000 -Buf /nfs/genomes/sgd_2010/bwa/sacCer3.fa *_reads.bt2.sorted_unique.bam | bcftools view -bvcg - >| ALL_reads.bt2.sorted_unique.raw.bcf
    6263}}}
    6364
     
    6667This step uses vcf-annotate from the [http://vcftools.sourceforge.net/docs.html VCFtools suite]
    6768
    68 Annotate variants by adding tags ("filters" but all variants are kept) to each variant, using all default filters
     69Annotate variants by adding tags ("filters" but all variants are kept) to each variant, using all default filters.
    6970{{{
    7071bcftools view -L -vcg A_reads.bt2.sorted_unique.raw.bcf | vcf-annotate -f + > A_reads.bt2.sorted_unique.withTags.bcf