magicDGS / ReadTools

A Universal Toolkit for Handling Sequence Data from Different Sequencing Platforms
https://magicdgs.github.io/ReadTools/
MIT License
6 stars 3 forks source link

Support other compression algorithms for FASTQ/Distmap data #411

Open magicDGS opened 6 years ago

magicDGS commented 6 years ago

The current implementation only support compression for different algorithms in HDFS, but in local or other java.nio.Path implementations it does only check for GZIP and BZIP2. In addition, this is highly inconsistent in several ways:

Maybe the best way will be to add a service loader in our own for the compression algorithms, and support the same in local and HDFS to avoid nightmares while testing in different file systems.

magicDGS commented 6 years ago

I downloaded recently a bzip2 compressed FASTA file, and it will be useful to be able to use it directly in ReadTools. At least that part should be done for version 2.0.0