Changes between Version 14 and Version 15 of SOPs/rna-seq-diff-expressions
- Timestamp:
- 01/15/14 14:23:08 (11 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
SOPs/rna-seq-diff-expressions
v14 v15 39 39 * ''bsub "samtools sort -n -m 5G accepted_hits.bam accepted_hitsSortedByname"'' 40 40 * To request a certain amount of memory and a specific node use ''bsub -R "rusage[mem=50000]" -m NodeName '' 41 * ''bsub "samtools sort -n -m 5G accepted_hits.bam accepted_hitsSortedByname"''42 * ''bsub "samtools view accepted_hitsSortedByname.bam | htseq-count -m intersection-strict --stranded=no - gene_models.gtf >| gene_model.counts.txt"''43 41 * Remove the rows at the bottom with descriptions like no_feature, ambiguous, etc. 44 42 * For custom analyses, you can add a pseudocount (such as 1) to all genes in all samples to prevent log2 ratios that require dividing by 0 and reduce background count noise -- BUT be aware that some statistical methods (like DESeq) require raw input values without any pseudocounts or normalization. … … 69 67 * Scaling so total number of counts is equal to mean number of mapped reads across all samples 70 68 * Effective library size (a more complex, but probably more valid method included in programs such as edgeR and DESeq) 71 * As with microarray normalization, be aware of the assumptions of each method and choose the method(s) which are most valid with your experiment 69 * '''Differential expression statistics packages can output a matrix of normalized counts (typically using the method recommended for the accompanying statistics), so typically no additional normalization needs to be done''' (unless one want to perform further normalization, such as using gene length). 70 * As with microarray normalization, be aware of the assumptions of each method and choose the method(s) which are most valid with your experiment. 72 71 * Origin of recommendation for upper-quartile normalization: [[http://www.ncbi.nlm.nih.gov/pubmed/20167110|Bullard et al., 2010]] 73 72 * See [[http://jura.wi.mit.edu/bio/scripts/R/normalize_DGE_matrix.R|normalize_DGE_matrix.R]] for a command-line script for count normalization