Changes between Version 5 and Version 6 of SOPs/SAMBAMqc
- Timestamp:
- 05/31/16 15:48:56 (9 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
SOPs/SAMBAMqc
v5 v6 101 101 bam_stat.py -i myFile.bam 102 102 }}} 103 ==== infer_experiment.py from RseQC package: can be used to check if theRNA-seq reads are stranded. ====103 ==== infer_experiment.py from RseQC package: Check if/how your RNA-seq reads are stranded. ==== 104 104 {{{ 105 105 … … 109 109 110 110 -i INPUT_FILE in SAM or BAM format 111 -r Reference gene model in bed fomat.111 -r Reference gene models in bed format (converted from GTF file). 112 112 113 # sample output on strand-specific PE reads: 113 --library-type=fr-unstranded 114 --library-type=fr-firststrand 115 --library-type=fr-secondstrand 116 117 # sample output on strand-specific PE reads (since the first fraction is much larger than the second fraction): 118 This is PairEnd Data 119 Fraction of reads explained by "1++,1--,2+-,2-+": 0.9807 120 Fraction of reads explained by "1+-,1-+,2++,2--": 0.0193 121 Fraction of reads explained by other combinations: 0.0000 122 # For gene counting with htseq-count, use --stranded=yes; mapping with TopHat should have been performed with --library-type=fr-secondstrand. 123 124 # sample output on strand-specific PE reads (since the second fraction is much larger than the first fraction): 114 125 This is PairEnd Data 115 126 Fraction of reads explained by "1++,1--,2+-,2-+": 0.0193 116 127 Fraction of reads explained by "1+-,1-+,2++,2--": 0.9807 117 128 Fraction of reads explained by other combinations: 0.0000 129 # For gene counting with htseq-count, use --stranded=reverse; mapping with TopHat should have been performed with --library-type=fr-firststrand. 118 130 119 # sample output on non-stranded PE reads :131 # sample output on non-stranded PE reads (since both fractions are about the same): 120 132 This is PairEnd Data 121 133 Fraction of reads explained by "1++,1--,2+-,2-+": 0.5103 122 134 Fraction of reads explained by "1+-,1-+,2++,2--": 0.4897 123 135 Fraction of reads explained by other combinations: 0.0000 136 # For gene counting with htseq-count, use --stranded=no; mapping with TopHat should have been performed with --library-type=fr-unstranded. 124 137 125 138 For pair-end RNA-seq, there are two different ways to strand reads: