EBI-Metagenomics / EukCC

Tool to estimate genome quality of microbial eukaryotes
GNU General Public License v3.0
31 stars 9 forks source link

error script workflow.py: index out of range #11

Closed Otavio20 closed 3 years ago

Otavio20 commented 3 years ago

Dear all,

I'm facing issues with the script workflow.py as pointed bellow: error

It is occurring even when the GeneMark-ES predict proteins. Thanks in advance.

openpaul commented 3 years ago

Thanks for reporting this. Seeing all the warnings about 'taxid not found' I would suggest you should update the ete3 NCBI taxonomy database.

http://etetoolkit.org/docs/latest/tutorial/tutorial_ncbitaxonomy.html#upgrading-the-local-database

This should fix the issue, but I also am working on a fix for this but will first need to run some tests before creating a new version (see commit: 425f3cb1ff184364cc3f19e7208bd8ba31a5cf2d). If you are familiar with python feel free to compile the package from the dev branch and test it.

The error occurs because ete3 cant compute lineages for any node, leaving EukCC without any information to infer the LCA for your MAG.

Otavio20 commented 3 years ago

It worked!

Thank you very much!