Skip to main content

ensembl/Ensembl Release 112

Demo
Multi-species (300+) 45 GB Apache 2.0 v112 Updated 3 weeks ago
This is a demo page showing what a dataset detail page looks like on Cyanea. The data shown is illustrative.

Gene annotations for 300+ vertebrate genomes from the Ensembl project.

Ensembl Release 112 provides comprehensive gene annotations for over 300 vertebrate genomes. This includes gene models, transcript structures, regulatory features, and comparative genomics data produced by the Ensembl annotation pipeline.

What’s included

  • Gene models with protein-coding, lncRNA, and pseudogene annotations
  • Transcript sequences (cDNA, CDS, protein) for all annotated genes
  • Regulatory features including promoters, enhancers, and CTCF sites
  • Comparative genomics with gene trees and whole-genome alignments

Use cases

Ensembl annotations are the standard reference for RNA-seq alignment, gene expression quantification, and variant annotation. The data is used by virtually every genomics pipeline for mapping reads to gene models.

Files

homo_sapiens/Homo_sapiens.GRCh38.112.gtf.gz 48 MB
homo_sapiens/Homo_sapiens.GRCh38.dna.primary_assembly.fa.gz 880 MB
mus_musculus/Mus_musculus.GRCm39.112.gtf.gz 38 MB
danio_rerio/Danio_rerio.GRCz11.112.gtf.gz 22 MB
README.md 4.8 KB

Formats

FASTA GTF GFF3 MySQL dumps

Tags

gene annotation vertebrate reference transcriptome