soedinglab / hh-suite

Remote protein homology detection suite.
https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-019-3019-7
GNU General Public License v3.0
544 stars 134 forks source link

Potential database bug on pdb70_from_mmcif_220313.tar.gz, focusing on 6WOV_C #330

Open ZiyaoLi opened 2 years ago

ZiyaoLi commented 2 years ago

:exclamation: Make to check out our User Guide.

Expected Behavior

to successfully run hhsearch on pdb70_from_mmcif_220313.tar.gz

Current Behavior

hhsearch errored out claiming "sequences in 6WOV_C do not all have the same number of columns".

Steps to Reproduce (for bugs)

Please make sure to execute the reproduction steps.

  1. install hh-suite via conda conda install -c conda-forge -c bioconda hhsuite
  2. run hhsearch -i a.a3m -o a.hhr -d /db/path/pdb70

I checked the downloaded tar.gz's md5sum which is correct.

I tried to untar the file multiple times with the same error.

The a3m file is at https://gist.github.com/ZiyaoLi/0d66b41d6c95aa6a4b60e5205010f57b, generated from mmseqs.

HH-suite Output (for bugs)

Please make sure to post the complete output of the tool you called. Please use gist.github.com.

https://gist.github.com/ZiyaoLi/0d66b41d6c95aa6a4b60e5205010f57b

Context

Providing context helps us come up with a solution and improve our documentation for the future.

Your Environment

Include as many relevant details about the environment you experienced the issue in.

version 3.3.0 (newest conda)