JensUweUlrich / Taxor

Fast and space-efficient taxonomic classification of long reads
BSD 3-Clause "New" or "Revised" License
42 stars 2 forks source link

how to build a db from multiple input files and directories? #14

Open rjsorr opened 1 week ago

rjsorr commented 1 week ago

Hi,

I have multiple seperated downloads from genome_updater, that I want to keep seperate. For example I have an Archaea, Bacteria and Fungus genome_updater directrory. Now I want to make a Taxor database of all 3 together, but how do I do this? Can I point to all three by simply using "*" in the path or can I, like kraken, add one after the other?

regards

JensUweUlrich commented 4 days ago

@rjsorr Taxor currently cannot deal with different directories of genome_updater. I'm actually thinking about Taxor accepting comma-separated lists of input directories (for the reference genome fasta files) and input files (for the taxonomic information). Is this something that would help you?

ulrikhorn commented 2 days ago

Hello, i am interested in the same as @rjsorr and a csv list of input directories sounds like it would work just fine. As long as it is possible to merge different downloads into a single taxor database. Thank you!