medvir / VirMet

Set of tools for viral metagenomics.
14 stars 5 forks source link

Version of VirMet and virus reference database #26

Closed mihuber closed 6 years ago

mihuber commented 7 years ago

It would be good if one could easily retrieve the version of MinVar and the last update date of the virus reference database (e.g. with a --version subcommand).

This would be important for

citation in publications follow-up of changes in virus taxonomy

ozagordi commented 6 years ago

It is already possible to have information on the viral database like

timavo:~ blastdbcmd -db /data/virmet_databases/viral_nuccore/viral_db -info
Database: Viral database indexed 2017-04-18
    57,481 sequences; 1,194,074,656 total bases

Date: Apr 18, 2017  7:58 AM Longest sequence: 2,473,870 bases

Volumes:
    /data/virmet_databases/viral_nuccore/viral_db

What else? Where should this information and the version go?

mihuber commented 6 years ago

Ok, but how do we know, with which version of the database a specific run/sample was analysed? Is the info on the database in a log file? Or do you keep track when the database is updated, so one could connect sample analysis date with the version of the database?

ozagordi commented 6 years ago

Would it be enough to write it in a file like run_info.txt?

mihuber commented 6 years ago

I would make it on the sample level, because somebody could re-run only one sample of a run with an updated database. You could just write blastdbcmd -db /data/virmet_databases/viral_nuccore/viral_db -info into sample_info.txt for every sample.