quixdb / squash

Compression abstraction library and utilities
https://quixdb.github.io/squash/
MIT License
406 stars 53 forks source link

Include genomics dataset #239

Open epruesse opened 6 years ago

epruesse commented 6 years ago

Data compression is of high interest to Bioinformatics - DNA sequencing machines can now generate data in the TB/day range. While there are dedicated formats, generic compression codecs are generally used as base layer.

The human genome is probably a bit large, but perhaps an E. Coli genome and its annotation would work as a test case.