Changes between Version 40 and Version 41 of SOPs/qc_shortReads


Ignore:
Timestamp:
08/30/17 10:42:44 (7 years ago)
Author:
gbell
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • SOPs/qc_shortReads

    v40 v41  
    207207
    208208
    209   * [[https://cutadapt.readthedocs.io/en/stable/|cutadapt]] is a good tool that is designed to find and remove adapters:
     209  * '''Method 1''': [[https://cutadapt.readthedocs.io/en/stable/|cutadapt]] is a good tool that is designed to find and remove adapters:
    210210    * more options than fastx_clipper, such as specifically trimming 5' or 3' adapters and specifying error rate (allowed mismatches)
    211211    * much more conservative than fastx_clipper.
     
    213213    * a vs b options from [[http://journal.embnet.org/index.php/embnetjournal/article/view/200|EMBnet.journal]]
    214214       [[Image(cutadapt.2.jpeg,500px)]]
    215     * sample command:
    216 
     215       * sample command:
    217216    {{{
    218217    bsub cutadapt -a GATCGGAAGAGCTCGTATGCCGTCTT -o Nanog_noAdapter.fastq Nanog.fastq
     
    224223
    225224
    226   * See [[http://hannonlab.cshl.edu/fastx_toolkit/commandline.html#fastx_clipper_usage|fastx_clipper usage]] (or ''fastx_clipper -h'') for more arguments
    227   * sample command:
    228 
     225  * '''Method 2''': [[http://hannonlab.cshl.edu/fastx_toolkit/commandline.html#fastx_clipper_usage|fastx_clipper]]
     226    * Sample command:
    229227{{{
    230228bsub "fastx_clipper -a CTGTAGGCACCATCAAT -i s2_sequence.txt -v -l 22 -o s2_sequence_noLinker.txt"  # fastq input and output
     
    240238}}}
    241239
    242 
    243   * If you get the message "Invalid quality score value..." you have the older range of quality scores.
    244     * Add the argument -Q 33, such as
    245     * fastx_clipper -a CTGTAGGCACCATCAAT -Q 33 -i s2_sequence.txt -v -l 22 -o s2_sequence_noLinker.txt
     240      * See [[http://hannonlab.cshl.edu/fastx_toolkit/commandline.html#fastx_clipper_usage|fastx_clipper usage]] (or ''fastx_clipper -h'') for more arguments
     241      * If you get the message "Invalid quality score value..." you have the older range of quality scores.
     242        * Add the argument -Q 33, such as
     243        * fastx_clipper -a CTGTAGGCACCATCAAT -Q 33 -i s2_sequence.txt -v -l 22 -o s2_sequence_noLinker.txt
     244
     245  * '''Method 3''': [[https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/|Trim Galore!]] combines cutadapt, FastQC, and the ability to guess what your adapters are
     246     * Sample command
     247{{{
     248# Usage : trim_galore [options] <filename(s)>
     249trim_galore --phred64 --fastqc  -o my_trimmed_reads raw_reads/My_reads.fq.gz
     250# Get all options
     251trim_galore --help
     252
     253--phred64 ==> Use Illumina 1.5 encoding quality scores
     254--fastqc ==> Run FastQC after trimming
     255-o ==> Print output files in this directory instead of the current directory
     256}}}
    246257
    247258== Trim reads to a specified length ==
    248259   * If we have reads of different lengths (//i.e.// because we clipped out the adapter sequences), we can trim them to have them all be the same length. Use **fastx_trimmer** for that.
    249260   * sample command:
    250 
    251  
    252261{{{
    253262bsub "fastx_trimmer -f 1 -l 22  -i s7_sequence_clipped.txt -o s7_sequence_clipped_trimmed.txt"  # fastq input and output