| 1 | == SNP Calling == |
| 2 | === mpileup (SAMtools) === |
| 3 | * Create bcf file (binary version of vcf) from alignment file (bam) |
| 4 | |
| 5 | |
| 6 | {{{ |
| 7 | #Use Q to set base quality threshold |
| 8 | samtools mpileup -Q 25 -ugf <reference.fasta> <file.bam> | bcftools view -bvcg - > accepted_hits.raw.bcf |
| 9 | }}} |
| 10 | |
| 11 | |
| 12 | * Convert bcf to vcf (vairant call format) and filter, if needed |
| 13 | |
| 14 | |
| 15 | {{{ |
| 16 | #filter using varFilter (from vcfutils) |
| 17 | #use Q to set mapping quality |
| 18 | #use d for minimum read depth |
| 19 | bcftools view accepted_hits.raw.bcf | vcfutils.pl varFilter -d 5 -Q 20 >| accepted_hits.raw.vcf |
| 20 | }}} |
| 21 | |
| 22 | |
| 23 | |
| 24 | == Filtering SNPs == |
| 25 | |
| 26 | * Remove, or retrieve, known SNPs |
| 27 | |
| 28 | |
| 29 | {{{ |
| 30 | #To remove known SNPs, use intersectBed (bedtools). Known SNPs can be downloaded from Ensembl or NCBI/UCSC (eg. dbSNP) |
| 31 | intersectBed -a accepted_hits.raw.filtered.vcf -b Mus_musculus.NCBIM37.60.bed -wo > filteredSNPs.vcf |
| 32 | }}} |
| 33 | |
| 34 | |
| 35 | == Annotating SNPs == |
| 36 | * Determining the effect of the SNP |
| 37 | |
| 38 | === snpEff === |
| 39 | |
| 40 | {{{ |
| 41 | #Use appropriate i) organism and ii) annotation (eg. UCSC, RefSeq, Ensembl) |
| 42 | #Mouse: mm37 (UCSC/RefSeq), mm37.61 (Ensembl) |
| 43 | #Human: hg37 (UCSC/RefSeq), hg37.61 (Ensembl) |
| 44 | #Output is created in several files: an html summary file and text files with detailed information |
| 45 | snpEff -vcf4 mm37.61 accepted_hits.raw.filtered.vcf > accepted_hits.raw.filtered.snpEff |
| 46 | |
| 47 | }}} |