encode/ENCODE4 ATAC-seq
DemoChromatin accessibility across 600+ biosamples from the ENCODE consortium.
The ENCODE4 ATAC-seq collection provides genome-wide chromatin accessibility profiles across 600+ biosamples, including primary tissues, cell lines, and in vitro differentiated cells. This is one of the largest standardized collections of open chromatin data available.
What’s included
- Aligned reads (BAM) processed through the ENCODE ATAC-seq pipeline
- Signal tracks (bigWig) for genome browser visualization
- Peak calls (narrowPeak BED) identifying accessible chromatin regions
- Standardized metadata with biosample ontology terms and quality metrics
Use cases
Used for identifying regulatory elements, transcription factor binding sites, and chromatin state across cell types. Essential for integrative analyses combining gene expression with epigenomic data.
Files
experiments/ENCSR*/alignments.bam
~2.4 TB
experiments/ENCSR*/signal.bigWig
~800 GB
experiments/ENCSR*/peaks.bed.gz
~12 GB
metadata.tsv
14 MB
README.md
5.1 KB