| | 14 | |
| | 15 | 1 - '''Index the reference genome for '''First Pass'''.''' |
| | 16 | |
| | 17 | To generate genome index files for STAR: |
| | 18 | {{{ |
| | 19 | bsub STAR --runMode genomeGenerate --genomeDir /path/to/GenomeDir --genomeFastaFiles /path/to/genome/fasta1 /path/to/genome/fasta2 --sjdbGTFfile /path/to/GTF/FileName.gtf --sjdbOverhang 100 --runThreadN 8 |
| | 20 | }}} |
| | 21 | |
| | 22 | The parameters included in the above sample command are: |
| | 23 | * '''--sjdbOverhang ''' Specifies the length of the genomic sequence around the annotated junction to be used in constructing the splice junctions database. For short reads (<50) use readLength - 1, otherwise a generic value of 100 will work as well (see manual for more info). |
| | 24 | * '''--sjdbGTFfile <GTF_file.gtf>''' Supplies STAR with a GTF file during the genomeGenerate step. Combined with the --sjdbScore <n> option during mapping, this will bias the alignment toward annotated junctions, and reduces alignment to pseudogenes. |