Changes between Version 10 and Version 11 of SOPs/atac_Seq


Ignore:
Timestamp:
08/14/20 09:58:42 (4 years ago)
Author:
gbell
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • SOPs/atac_Seq

    v10 v11  
    66
    77=== Basic Approach ===
    8  * Pre-process reads (remove adapters and other "contamination")
    9  * Map reads to the genome (with an unspliced mapping tool)
     8 * [#preprocess Pre-process reads](remove adapters and other "contamination")
     9 * [#map Map reads to the genome] (with an unspliced mapping tool)
    1010 * Run quality control and calculate QC metrics
    11  * Identify ATAC-seq "peaks" with a high coverage of mapped reads
     11 * [#call_peaks Call ATAC-seq "peaks"] with a high coverage of mapped reads
    1212 * Analyze peak regions for binding motifs
    1313 * Identify differentially accessible regions (for multiple-sample experiments)
    1414 * Identify potential targets of peak regions and/or differentially accessible regions
    1515
     16  * Paired-end reads are better and recommended over single reads for ATAC-seq.  The original [[https://www.nature.com/articles/nmeth.2688 | ATAC-Seq method]] from the Greenleaf lab recommends PE reads, stating, "We found that ATAC-seq paired-end reads produced detailed information about nucleosome packing and positioning."
    1617
    17 === ATAC-Seq Analysis Details ===
     18=== [=#preprocess Pre-process reads] ===
    1819
    19   * Paired-end reads are better and recommended over single reads for ATAC-seq.  The original [[https://www.nature.com/articles/nmeth.2688 | ATAC-Seq method]] from the Greenleaf lab recommends PE reads, stating, "We found that ATAC-seq paired-end reads produced detailed information about nucleosome packing and positioning."
    2020  * Check adapter contamination with fastqc. Remove adapters with cutadapt or trim_galore:
    2121
     
    2727}}}
    2828
    29   * Map reads with bowtie2, for PE keep only concordant pairs
     29=== [=#preprocess Map reads] ===
     30
     31  * Map reads with bowtie2,
     32  * If mapping paired-end reads, keep only concordant pairs
    3033
    3134{{{
    32 # --very-sensitive options --no-discordant    suppress discordant alignments for paired reads
    33 # -X/--maxins <int>  maximum fragment length (500). Increase to 2000 to get a better nucleosome distribution.
     35# --very-sensitive
    3436# --no-discordant    suppress discordant alignments for paired reads
     37# -X/--maxins <int>  maximum fragment length (default=500). Increase to 2000 to get a better nucleosome distribution.
    3538
    3639bowtie2 --very-sensitive --no-discordant -p 2 -X 2000 -x /nfs/genomes/human_hg38_dec13_no_random/bowtie/hg38 -1 read1.fq -2 read2.fq | samtools view -ub - | samtools sort -T $name - >| bowtie_out.bam
    3740}}}
    3841
    39   * Remove duplicates, if needed check fastqc to see deduplication level.  Remove with Picard MarkDuplicates or Samtools rmdup
    40 
     42  * Remove duplicates with Picard MarkDuplicates or Samtools rmdup
     43  * Check deduplication level with 'fastqc'
    4144  * Remove reads mapped to mitochondria
    4245
     
    4548}}}
    4649
    47   * Call peaks using MACS
     50=== [=#call_peaks Call peaks] ===
    4851
     52MACS v2 works well for this.
     53
     54If using the 'BAMPE' option with paired-end reads, let MACS run the pileup and calculate 'extsize'; ask MACS to keep only 1 read if duplicates are present.
    4955{{{
    50 #use this only if using BAMPE, let MACS pileup and calculate extsize; duplicates were not removed
     56
    5157macs2 callpeak -f BAMPE -t file.sorted.bam --broad --keep-dup 1 -B -q 0.01 -g mm -n MACS_ATACSeq_Peaks
    5258
     
    8288
    8389
    84 ==== Tips/Recommendations for ATAC-Seq ====
     90=== Tips/Recommendations for ATAC-Seq ===
    8591//Based on presentation by H.Liu at BioC 2020 presentation //
    8692  *  post-alignment filtering: remove mito/ChrM reads, remove duplciates, remove mapping artifacts: < 38bp and fragments > 2kb, and discordant reads