biocore / metagenomics_pooling_notebook

Jupyter notebooks to assist with sample processing
MIT License
8 stars 16 forks source link

Add support for an optional file containing sequence counts. #195

Open charles-cowart opened 2 months ago

charles-cowart commented 2 months ago

It would speed things up considerably w/in SPP if seqpro and counts.py could accept a file containing sequence counts for a directory of fastq files. Instead of counting sequences in every file sequentially using seqtk, SPP could count them in parallel during their creation. The file should be optional and the current behavior should stay as the default if the file isn't provided.