
Illumina is committed to delivering innovative sequencing technologies, and to helping customers manage growing volumes of next-generation sequencing (NGS) data output. Lossless genomic data compression technology from Enancio, formerly known as Lena and now known as original read archive (ORA) compression, offers optimal levels of speed and efficiency.
Genomic data compression allows for:
Lossless genomic data compression technology reduces the data storage footprint by as much as five times by compressing the output from Illumina sequencing systems. ORA compression technology uses a reference-based compression method. The idea is to use an ultra-fast mapping scheme to map reads onto a reference genome, and then store only the data needed to regenerate those reads: a position and a list of differences.
Other data compression technologies usually suffer from low speed. ORA compression technology is optimized for high compression ratios, as well as fast compression and decompression rates, while preserving data integrity. Quality scores are encoded in a lossless way using a range encoder and context models adapted to the different types of quality schemes.
All files compressed with ORA compression technology can easily be decompressed using our decompression software. The decompression software is free to download and use.
Download decompression softwareOnce the decompression software is installed, a simple command can be used to directly pipe the output of decompression on the fly into a wide range of popular mapping tools such as BWA, STAR, and Bowtie. The compression and decompression technology is also integrated within DRAGEN secondary analysis software, which provides accurate, ultra-rapid analysis of sequencing data.

DRAGEN ORA lossless genomic data compression is now available on-instrument with the NextSeq 1000 and NextSeq 2000 Systems and NovaSeq X Series as well as on the DRAGEN secondary analysis server starting with v3.8. Learn more about:
NextSeq 1000/2000 Systems
NovaSeq X Series
DRAGEN secondary analysis
Securely store, process, and share large genomic and NGS datasets in the cloud with built-in speed and scalability.
Our sequencing data analysis software helps you spend more time doing research, and less time configuring and running analysis workflows.
Explore a broad range of informatics products designed to simplify genomic data analysis and management.
Contact us to learn more.