debbiemarkslab / EVcouplings

Evolutionary couplings from protein and RNA sequence alignments
http://evcouplings.org
Other
238 stars 76 forks source link

InvalidParameterError: provided annotation file has no annotation information #214

Open sacdallago opened 5 years ago

sacdallago commented 5 years ago
Traceback (most recent call last):
  File "/n/groups/marks/software/anaconda_o2/envs/evcouplings_backend_develop/lib/python3.5/site-packages/evcouplings/utils/pipeline.py", line 389, in execute_wrapped
    outcfg = execute(**config)
  File "/n/groups/marks/software/anaconda_o2/envs/evcouplings_backend_develop/lib/python3.5/site-packages/evcouplings/utils/pipeline.py", line 185, in execute
    outcfg = runner(**incfg)
  File "/n/groups/marks/software/anaconda_o2/envs/evcouplings_backend_develop/lib/python3.5/site-packages/evcouplings/complex/protocol.py", line 576, in run
    return PROTOCOLS[kwargs["protocol"]](**kwargs)
  File "/n/groups/marks/software/anaconda_o2/envs/evcouplings_backend_develop/lib/python3.5/site-packages/evcouplings/complex/protocol.py", line 454, in best_hit
    kwargs["paralog_identity_threshold"]
  File "/n/groups/marks/software/anaconda_o2/envs/evcouplings_backend_develop/lib/python3.5/site-packages/evcouplings/complex/protocol.py", line 427, in _load_monomer_info
    annotation_table = read_species_annotation_table(annotations_file)
  File "/n/groups/marks/software/anaconda_o2/envs/evcouplings_backend_develop/lib/python3.5/site-packages/evcouplings/complex/similarity.py", line 61, in read_species_annotation_table
    annotation_file
evcouplings.utils.config.InvalidParameterError: provided annotation file 7a74f5bd34b11f505e287d1fd767fbf9/align_1/7a74f5bd34b11f505e287d1fd767fbf9_annotation.csv has no annotation information

One job failed with this error: /n/groups/marks/web/backend/runs/7a74f5bd34b11f505e287d1fd767fbf9

aggreen commented 5 years ago

This is not a bug, it's an error message I put in deliberately to flag when the user's annotation file doesn't have any annotation information. I'm guessing either they aligned against a weird database, uploaded their own alignment and didn't provide any annotation information, or they aligned against uniref/uniprot and didn't find any hits. We may want to think of better ways to communicate this to the user, but ultimately their job has failed because their sequences had no species information that could extract.

sacdallago commented 5 years ago

Ok.

Thy aligned against uniref100 using the pipeline (upload of alignments is not yet possible), so most likely the second case (didn't find any hits). I tried to blast one of the two sequences and indeed I got very ridiculous hits. So this is an accepted failure? Think it would qualify as something to add to the FAQ on the website...

config.txt fasta.txt

sacdallago commented 5 years ago

P.S.: Yes, maybe a more explicative error message in the pipeline would be great?