bluenote-1577 / sylph

ultrafast taxonomic profiling and genome querying for metagenomic samples by abundance-corrected minhash.
MIT License
185 stars 6 forks source link

Version used for pre-sketched viral database #15

Closed simeonhebrew closed 3 months ago

simeonhebrew commented 3 months ago

Hello, thanks for this amazing tool!

Just wanted to confirm which version of the IMG/VR4 database you made the pre-sketched viral database from. And whether you possess the corresponding metadata file.

Thank you!

bluenote-1577 commented 3 months ago

Hi @simeonhebrew ,

The data should corresponding to the latest IMG/VR 4.1 high confidence vOTUs. In particular, the sequences were dereplicated to vOTU according to the Metadata files from imgvr.

You can get the Metadata files from https://genome.jgi.doe.gov/portal/IMG_VR/IMG_VR.home.html after creating an account and going to their download page

simeonhebrew commented 3 months ago

Okay, thanks!