encode/ATAC-seq Pipeline
Published DemoEnd-to-end ATAC-seq analysis with QC reports — from FASTQ to peak calls.
A production-grade ATAC-seq analysis pipeline following ENCODE best practices. This space bundles analysis notebooks, reference datasets, and quality control protocols into a single reproducible workspace.
Pipeline steps
- Read QC — adapter trimming, quality filtering with fastp
- Alignment — Bowtie2 alignment to GRCh38 with mitochondrial read filtering
- Post-alignment QC — duplicate marking, fragment size distribution, TSS enrichment
- Peak calling — MACS2 narrow peak calling with IDR for replicate consistency
- Signal tracks — normalized bigWig generation for genome browser visualization
- Differential accessibility — DESeq2-based differential analysis between conditions
Quality metrics
Every run produces a comprehensive HTML QC report including library complexity estimates, TSS enrichment scores, FRiP values, and reproducibility metrics (IDR).
Contents
6 Notebooks
4 Datasets
2 Protocols