Closed iferres closed 7 months ago
Thank you for using MetaCerberus and your suggestions.
For your suggestions which we find very helpful. Can I asked some small clarifications.
No apologies needed. We really appreciate your suggestions and thoughts. We also think it has great potential and user friendly is what we really strive for. Lets us know your thoughts? We welcome suggestions and ways to make our tools better.
many thanks, RAW
--help
would help. Regarding the output, a small description of the output subdirectories and about you consider are the most important files.metacerberus.py ... --db_dir /path/to/dbs --list_dbs
, and check in the default location if user don't provide the --db_dir
argument. What do you think? (Please, name the arguments as you wish, I just invent some to illustrate the point) metacerberus.py ... --hmm PHROG,VOG,KOFam_prokaryote --prioritize_hmm_order
, then the tool will return PHROG annotation for those which have a hit for this DB, and then VOG for those which don't have a PHORG hit but a VOG, and so on.. My use-case scenario is that I'm interested in phages, and for my very particular case PHROG is the best DB. Thanks for considering my suggestions and for the tool. You can close this issue, don't need to report back :) Bests!
We are working a new version 1.2 based on your suggestions.
Thank you for using MetaCerberus! RAW lab
Hi, thank you for this awesome tool!
I have a couple of suggestions which may improve the (UX of the) software.
1) DB installation path: The
--setup
command doesn't allow to set a custom path to store the DB. Of course, in that case one should also be able to indicate metacerberus where to find it (new CLI option). It is sometime useful to dockerize the applications, and by having the database outside the container result in a way smaller image size. Multiple instances of the same images could use a single DB installed in a shared disk (think in a HPC environment).2) Documentation: It's not well documented that users can opt to search more than one DB at once by passing them as comma-separated arguments (i.e.
metacerberus.py ... --hmm VOG,PHROG
), I had to guess it. Also it would be nice to have documentation about the output files.3) A
--list_db
command to check available DBs.4) Prioritize DB (more difficult to implement, consider it just a comment): I have noticed that the best hit column may not select the best annotated DB (which is a subjective characteristic). For instance, using the above case, I would prioritize PHROG over VOG, which annotation is better curated. But in some cases where both DBs got hits, the best is kept:
In the above case (I'm looking at
step_10-visualizeData/Protein_phanotate/annotation_summary.tsv
file), I would prefer PHROG annotation over VOG's since I'm sure I would get more information thanhypothetical protein
. It would be nice to have the possibility to set a DB priority ranking.Sorry if sounds like a pedant review, I just think metacerberus has great potential and I would like it to be more user friendly :)
Regards