Changes between Version 40 and Version 41 of SOPs/qc_shortReads
- Timestamp:
- 08/30/17 10:42:44 (7 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
SOPs/qc_shortReads
v40 v41 207 207 208 208 209 * [[https://cutadapt.readthedocs.io/en/stable/|cutadapt]] is a good tool that is designed to find and remove adapters:209 * '''Method 1''': [[https://cutadapt.readthedocs.io/en/stable/|cutadapt]] is a good tool that is designed to find and remove adapters: 210 210 * more options than fastx_clipper, such as specifically trimming 5' or 3' adapters and specifying error rate (allowed mismatches) 211 211 * much more conservative than fastx_clipper. … … 213 213 * a vs b options from [[http://journal.embnet.org/index.php/embnetjournal/article/view/200|EMBnet.journal]] 214 214 [[Image(cutadapt.2.jpeg,500px)]] 215 * sample command: 216 215 * sample command: 217 216 {{{ 218 217 bsub cutadapt -a GATCGGAAGAGCTCGTATGCCGTCTT -o Nanog_noAdapter.fastq Nanog.fastq … … 224 223 225 224 226 * See [[http://hannonlab.cshl.edu/fastx_toolkit/commandline.html#fastx_clipper_usage|fastx_clipper usage]] (or ''fastx_clipper -h'') for more arguments 227 * sample command: 228 225 * '''Method 2''': [[http://hannonlab.cshl.edu/fastx_toolkit/commandline.html#fastx_clipper_usage|fastx_clipper]] 226 * Sample command: 229 227 {{{ 230 228 bsub "fastx_clipper -a CTGTAGGCACCATCAAT -i s2_sequence.txt -v -l 22 -o s2_sequence_noLinker.txt" # fastq input and output … … 240 238 }}} 241 239 242 243 * If you get the message "Invalid quality score value..." you have the older range of quality scores. 244 * Add the argument -Q 33, such as 245 * fastx_clipper -a CTGTAGGCACCATCAAT -Q 33 -i s2_sequence.txt -v -l 22 -o s2_sequence_noLinker.txt 240 * See [[http://hannonlab.cshl.edu/fastx_toolkit/commandline.html#fastx_clipper_usage|fastx_clipper usage]] (or ''fastx_clipper -h'') for more arguments 241 * If you get the message "Invalid quality score value..." you have the older range of quality scores. 242 * Add the argument -Q 33, such as 243 * fastx_clipper -a CTGTAGGCACCATCAAT -Q 33 -i s2_sequence.txt -v -l 22 -o s2_sequence_noLinker.txt 244 245 * '''Method 3''': [[https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/|Trim Galore!]] combines cutadapt, FastQC, and the ability to guess what your adapters are 246 * Sample command 247 {{{ 248 # Usage : trim_galore [options] <filename(s)> 249 trim_galore --phred64 --fastqc -o my_trimmed_reads raw_reads/My_reads.fq.gz 250 # Get all options 251 trim_galore --help 252 253 --phred64 ==> Use Illumina 1.5 encoding quality scores 254 --fastqc ==> Run FastQC after trimming 255 -o ==> Print output files in this directory instead of the current directory 256 }}} 246 257 247 258 == Trim reads to a specified length == 248 259 * If we have reads of different lengths (//i.e.// because we clipped out the adapter sequences), we can trim them to have them all be the same length. Use **fastx_trimmer** for that. 249 260 * sample command: 250 251 252 261 {{{ 253 262 bsub "fastx_trimmer -f 1 -l 22 -i s7_sequence_clipped.txt -o s7_sequence_clipped_trimmed.txt" # fastq input and output