Getting error when running apply_to_db.py

thesamuanels commented 1 year ago

Hi, I am using phylofisher v1.2.9. I have used it once before but I hadn't encounter the following error then.

I am trying to run the command: apply_to_db.py -t 8 -i parasorter_out_Aug.03.2023/ -fi fisher_out_Jul.21.2023/

parasorter_out_Aug.03.2023 includes all parsed tsv files parsed from single gene trees fisher_out_Jul.21.2023 is the output folder from the fisher.py step.

The message I am getting: Traceback (most recent call last): File "/opt/Anaconda3/envs/fisher_1.2.9/bin/apply_to_db.py", line 379, in <module> main() File "/opt/Anaconda3/envs/fisher_1.2.9/bin/apply_to_db.py", line 325, in main new_database(table) File "/opt/Anaconda3/envs/fisher_1.2.9/bin/apply_to_db.py", line 250, in new_database orthologs, paralogs = parse_table(table) File "/opt/Anaconda3/envs/fisher_1.2.9/bin/apply_to_db.py", line 163, in parse_table record = seq_dict[abbrev] KeyError: 'rhaphid2'

"rhaphid2" is the unique ID of a taxon I wanted to add to the database.
I have even tried to exclude rhaphid2 by using the "--to_exclude" flag, but that didn't work either. The unique id looks the same everywhere, and I don't understand why that one in particular should be problematic, especially since there is another one called 'rhaphid1'.

EDIT: I have tried to replace the id with another id in a copied folder, and it doesn't seem like the issue is the specific id name. Additionally, I've just remembered I had issues running forest.py, so back then I used the command: xvfb-run forest.py --local_run -i sgt_out_<DATE>.tar.gz Including the xvfb-run part before the command made it work. I don't know why it didn't work without it, since I have just tried again without, and now it runs smoothly. I wonder if this can be related to the main error in some ways.

Any idea what the issue might be? Thanks