Changes between Version 22 and Version 23 of SOPs/rna-seq-diff-expressions
- Timestamp:
- 03/12/15 14:55:29 (10 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
SOPs/rna-seq-diff-expressions
v22 v23 32 32 * Carefully choose the best "mode" to handle reads that don't completely map to exactly one gene 33 33 * Counting can be also performed transcript level but this may be too much information 34 * htseq-count typically uses a SAM file as input, but a BAM fine can be used in place, for example35 * '' samtools view accepted_hits.bam | htseq-count -m intersection-strict --stranded=no -gene_models.gtf >| gene_model.counts.txt''34 * htseq-count can accept SAM or BAM (-f option), for example 35 * ''htseq-count -f bam -m intersection-strict --stranded=no accepted_hits.bam gene_models.gtf >| gene_model.counts.txt'' 36 36 * Note that htseq-count assumes that your reads are strand-specific, so if not, include "--stranded=no" (or half of your reads won't be counted). 37 * Note the "-" before the gtf filename; this indicates that the SAM file is coming from standard input (in this case, piped from samtools). 38 * For paired-end reads the sam file has to be sorted by read name with a command like this: 39 * ''bsub "samtools sort -n -m 5G accepted_hits.bam accepted_hitsSortedByname"'' 37 * Note the "-" can be used for input from standard input (stdin) 38 * For paired-end reads the sam file has to be sorted by read name, or coordinate, eg. ''bsub "samtools sort -n -o accepted_hits.sortedByName.bam -m 5G -O bam -T temp accepted_hits.bam"'' 40 39 * To request a certain amount of memory and a specific node use ''bsub -R "rusage[mem=50000]" -m NodeName '' 41 40 * Remove the rows at the bottom with descriptions like no_feature, ambiguous, etc.