soedinglab / hh-suite

Remote protein homology detection suite.
https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-019-3019-7
GNU General Public License v3.0
515 stars 128 forks source link

issue using cstranslate #350

Closed swohlrab closed 1 year ago

swohlrab commented 1 year ago

:exclamation: Make to check out our User Guide.

Expected Behavior

I use the cstranslate command based on a mmseqs search output as described in the mmseq2 manual on p. 75: mmseqs search DBquery DBtarget searchOut tmp -a mmseqs result2msa DBquery DBtarget searchOut searchMsa --msa-format-mode 1 cstranslate -i searchMsa -o searchMsa_cs219 -f -x 0.3 -c 4 -I ca3m -b

Current Behavior

It does not complete due to several errors with the same issue eg: Could not read entry: 20, Message: Sequence 9 has 104 match columns but should have 105!

Does anyone know what the reason for this could be? Could it be an issue that some of the sequences (aa) DBquery and DBtarget are the same?

Thanks for help!