shenwei356 / taxonkit

A Practical and Efficient NCBI Taxonomy Toolkit, also supports creating NCBI-style taxdump files for custom taxonomies like GTDB/ICTV
https://bioinf.shenwei.me/taxonkit
MIT License
357 stars 29 forks source link

modification of the dmp files? #71

Closed nick-youngblut closed 1 year ago

nick-youngblut commented 1 year ago

My nextflow pipeline includes downloading and uncompressing the taxdump files. My pipeline always re-runs this step, even when using caching in nextflow (-resume). I'm wondering if the issue is due to taxonkit modifying the *.dmp files upon the first usage of taxonkit. I'm specifically running taxonkit name2taxid

shenwei356 commented 1 year ago

It does not change them.

nick-youngblut commented 1 year ago

Thanks for the quick response! I'm guessing that it is not practical to change the functionality, so I'll just pre-download the taxdump prior to the pipeline run.