Closed konstin closed 2 years ago
Thanks for reporting this issue. There is clearly a problem with reading the database and parsing the sequence description in the binary asn.1 format created by formatdb. Previously, I have seen that this format has been changing a bit over time and there might be something similar going on here. I will look into the issue as soon as I have time.
I think this is fixed now in commit 314123b8ce5339dd3952e541678e5f848c6b2487.
It was a problem with very long headers in some of the database entries. Tested with the database you used and seems to work now.
Fixed in release 2.1.1 now available.
Please note that one can also use the modern makeblastdb
utility from NCBI, even the recent 2.11.0+ version, if the -blastdb_version 4
option is specified to makeblastdb
. It should be compatible with the databases created by the old formatdb
.
I hope this solves the issue, @konstin.
Yes it worked, thank you!
I tried to use swipe to annotate uniclust30 with SCOP2 sequences and a specific, innocuous looking sequence reliably causes an error.
I'm using uniclust30_2018_10 from http://gwdu111.gwdg.de/~compbiol/uniclust/2018_08/uniclust30_2018_08.tar.gz and the the following minimized fasta file:
Just using the first sequence passes, but using the both sequences, I get the following error. To me, the second sequence looks innocuous, and it also wasn't the longest sequence in the sequences I tested and which worked.
swipe invocation:
log excerpt, with the hits cut out:
Versions