magicDGS / ReadTools

A Universal Toolkit for Handling Sequence Data from Different Sequencing Platforms
https://magicdgs.github.io/ReadTools/
MIT License
6 stars 3 forks source link

Implement spark support for our framework #110

Open magicDGS opened 7 years ago

magicDGS commented 7 years ago

Issue for all ideas related with this

We can implement a Spark framework for our walkers/framework. For this, we require:

All these ideas requires their own issue eventually.

magicDGS commented 7 years ago

Update

For implement reads traversal, we require:

magicDGS commented 7 years ago

There is also an implementation for the FASTQ input format in the ADAM project. Concretely, there are implementation for both single-end files and interleaved pair-end for the org.bdgenomics.adam.io.FastqRecordReader. There are limitations about the compression in the interleaved format, and maybe we can use a workaround for the single-end files to support also split pair-end.