nf-core / metapep

From metagenomes to epitopes and beyond
https://nf-co.re/metapep
MIT License
8 stars 5 forks source link

Implementation of `csvkit` for stream merging csv #136

Open tillenglert opened 2 weeks ago

tillenglert commented 2 weeks ago

Description of feature

Currently some processes, like MERGE_PREDICTIONS use a custom concat_tsv.py script enabling streamed merging of many cvs, as prediction tables tend to get very large and multiple 100ed files are concatenated.

To reduce maintenance load we could implement an nf-core module for: https://csvkit.readthedocs.io/en/latest/contributing.html#streaming-versus-buffering