Changes between Version 5 and Version 6 of SOPs/SAMBAMqc


Ignore:
Timestamp:
05/31/16 15:48:56 (9 years ago)
Author:
gbell
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • SOPs/SAMBAMqc

    v5 v6  
    101101bam_stat.py -i myFile.bam
    102102}}}
    103 ==== infer_experiment.py from RseQC package: can be used to check if the RNA-seq reads are stranded. ====
     103==== infer_experiment.py from RseQC package: Check if/how your RNA-seq reads are stranded. ====
    104104{{{
    105105
     
    109109
    110110-i INPUT_FILE in SAM or BAM format
    111 -r Reference gene model in bed fomat.
     111-r Reference gene models in bed format (converted from GTF file).
    112112
    113 # sample output on strand-specific PE reads:
     113--library-type=fr-unstranded
     114--library-type=fr-firststrand
     115--library-type=fr-secondstrand
     116
     117# sample output on strand-specific PE reads (since the first fraction is much larger than the second fraction):
     118This is PairEnd Data
     119Fraction of reads explained by "1++,1--,2+-,2-+": 0.9807
     120Fraction of reads explained by "1+-,1-+,2++,2--": 0.0193
     121Fraction of reads explained by other combinations: 0.0000
     122# For gene counting with htseq-count, use --stranded=yes; mapping with TopHat should have been performed with --library-type=fr-secondstrand.
     123
     124# sample output on strand-specific PE reads (since the second fraction is much larger than the first fraction):
    114125This is PairEnd Data
    115126Fraction of reads explained by "1++,1--,2+-,2-+": 0.0193
    116127Fraction of reads explained by "1+-,1-+,2++,2--": 0.9807
    117128Fraction of reads explained by other combinations: 0.0000
     129# For gene counting with htseq-count, use --stranded=reverse; mapping with TopHat should have been performed with --library-type=fr-firststrand.
    118130
    119 # sample output on non-stranded PE reads:
     131# sample output on non-stranded PE reads (since both fractions are about the same):
    120132This is PairEnd Data
    121133Fraction of reads explained by "1++,1--,2+-,2-+": 0.5103
    122134Fraction of reads explained by "1+-,1-+,2++,2--": 0.4897
    123135Fraction of reads explained by other combinations: 0.0000
     136# For gene counting with htseq-count, use --stranded=no; mapping with TopHat should have been performed with --library-type=fr-unstranded.
    124137
    125138For pair-end RNA-seq, there are two different ways to strand reads: