Compress. Connect. Collaborate.
As genomic sequencing increases around the globe, it becomes vital to store this data efficiently and sustainably. GA4GH’s CRAM file format for genomic data compression tackles this challenge and helps facilitate global collaboration.
Scroll through the videos below to learn how it has benefited existing users and how you can get involved, too.
Chair of GA4GH, Director of EMBL-EBI
CRAM is a database that uses various algorithms to compress the data it stores. Some of these algorithms are universal, others leverage the unique fact that most human genomes are very similar to the reference human genome.
CRAM files store data in columns that are aligned to the reference sequence, allowing users to extract information efficiently from particular subsets of the file on particular chromosomes — one of the major use cases of DNA data.
Wellcome Sanger Institute
The Broad Institute of MIT and Harvard; NIH All of Us Research Program