Explore 3.6 PB of genomics data across 4.4M files
Data from various genomics file formats (BAM, VCF, BED, etc), and sequencing technologies.
Sequencing data and analysis of 10 trios. First complete human genome assembly.
Reference data from several sequencing technologies. Used as ground truth for benchmarking.
Sequencing data and analysis of >2,500 individuals from around the world.
Sample data used for testing and benchmarking the DeepVariant variant caller.
Sample datasets from the Broad Institute for testing bioinformatics workflows.
Data from the Vertebrate Genomes Project (VGP), featuring reference genomes for vertebrate species.
Microbiome data of 300 healthy adults, and several individuals with disease conditions.
Sequencing datasets and reference genomes of several threatened Australasian species.
Sequencing data and analysis of >3,000 rice varieties from 89 countries.
Test datasets for the GATK variant caller, with data from WGS, WES, and RNA-seq.
Data from the Element Bio manuscript about the Avidity instrument.
Oxford Nanopore benchmarking datasets from various sequencing chemistries and samples.
Analysis of pediatric brain tumors: gene expression, gene fusions, somatic mutations, CNVs, and SVs.
Access a custom public bucket