marbl / MetaCompass

MetaCompass: Reference-guided Assembly of Metagenomes
https://github.com/marbl/MetaCompass/wiki
Other
38 stars 11 forks source link

Using MetaCompass with GTDB instead of refseq #15

Closed chassenr closed 4 years ago

chassenr commented 4 years ago

Hi @vcepeda, thanks for this very interesting new tool! I am about to install MetaCompass for the first time, and I have a question about the reference database (refseq), which is downloaded by default in the install.sh script. Is it possible to supply a different reference database, e.g. gtdb? Can the reference database be located outside the install directory (to save disk space by avoiding file duplication if the database is also used for other programs)?

Thanks!

Cheers, Christiane

vcepeda commented 4 years ago

Hi @chassenr, It's not possible to use a different database in the current version. The "refseq" directory doesn't contain the complete archaeal bacterial RefSeq database. It contains the complete and almost complete RefSeq genomes and genes and many additional files that are not part of Refseq, such as the predicted marker genes, clustered marker genes, mash sketches, and taxonomy files. I haven't included the code to generate the database yet, but to make a customize DB you need complete or chromosome level genomes and the predicted genes per genome.

chassenr commented 4 years ago

Hi @vcepeda , thanks for the info. I will use the default database ;) What about the file location? Can I save the database outside of the install directory? On our server, the location for programs is not big enough for the database as we usually have a separate directory for databases. I would like to avoid installing the program somewhere else to maintain a clean file structure. But this is just a question of convenience... Thanks for your feedback.

Cheers, Christiane

vcepeda commented 4 years ago

After running the install directory, I'd suggest to move the refseq directory somewhere else and create a symlink to refseq in the install directory

chassenr commented 4 years ago

Thanks @vcepeda that worked perfectly. Closing the issue now.