Closed rderelle closed 6 months ago
Looking at the code, file "panaroo_runner/annotate.py", I think the issue might be related to the Diamond database (which is either absent, corrupted or in a wrong format). Unfortunately, I can't figure out where it should be located.
Hi Romain, could you send me the commands you used to install ggCaller, please?
Hi Sam,
Here are my installation commands:
conda env create -f environment_linux.yml
conda activate ggc_env
git clone --recursive https://github.com/samhorsfield96/ggCaller && cd ggCaller
python setup.py install
Thanks
A similar issue has come up before I believe, please see issues #16 and #20 and let me know if this solves your problem.
Thanks Sam, it now works!
For those having a similar issue (faulty installation via Conda): _ download databases/models at https://ftp.ebi.ac.uk/pub/databases/pp_dbs/ggCallerdb.tar.bz2 (issue #16 mentioned above by Sam) _ locate your Conda files and (re-)install the databases/models. In my case, the path was: miniconda3/envs/ggc_env/lib/python3.9/site-packages/ggCaller-1.3.4-py3.9-linux-x86_64.egg/models/ggCallerdb
Closing as completed.
Dear Sam,
I'm using ggCaller v1.3.4, which I have installed using Conda (Docker is not available on the Linux cluster I'm using). I tested ggCaller with 10 Mtb genome assemblies I built using the command line:
ggcaller --refs test_input.txt --threads 4
And got the following error message:
I tried again with different genome assemblies but obtained the same error message. Unfortunately, I'm only interested in the gene presence/absence CSV matrix, not the gene annotation that is not performed by default according to the doc (I assume here that Diamond is used to annotate ORFs).
Any thoughts about what could have gone wrong? Happy to give more details if needed.
Thanks, Romain