| 1 | | None at this time |
| | 1 | These are some useful references: |
| | 2 | |
| | 3 | |
| | 4 | Ref1 for estimates of the number of reads required for single |
| | 5 | nucleotide variant calling: [http://www.ncbi.nlm.nih.gov/pubmed/21771779/] |
| | 6 | |
| | 7 | Ref2 fFor estimates of the number of reads required for single |
| | 8 | nucleotide variant calling: [http://www.ncbi.nlm.nih.gov/pubmed/24434847] |
| | 9 | |
| | 10 | For estimates of the number of reads required for RNA-seq |
| | 11 | and ChIP-seq experiments: [http://www.ncbi.nlm.nih.gov/pubmed/24434847/] |
| | 12 | |
| | 13 | Example 1 |
| | 14 | |
| | 15 | For a 3e+9 nt genome if we want 35x coverage we would need: |
| | 16 | |
| | 17 | 3e+9 * 35 / 40 = 2.625e+09 = 2.6 billion 40-nt reads |
| | 18 | or |
| | 19 | 3e+9 * 35 / 100 = 1.05e+09 = 1 billion 100-nt reads |
| | 20 | |
| | 21 | |
| | 22 | |
| | 23 | Example 2 |
| | 24 | |
| | 25 | For an RNA_seq experiment: |
| | 26 | If we have 6 million paired end reads and a genome with ~7000 genes expressed X 5741 pb average gene length = 40,187,000. That is 40 mill nt to cover. |
| | 27 | |
| | 28 | 6M reads x 70 bp (35 pb per paired end reads) = 420 mill bp that we will cover. |
| | 29 | 420 mill nt that we will cover/ 40 mill nt to cover ~ 10 x coverage. |
| | 30 | |
| | 31 | |
| | 32 | |