rcsb / symmetry

:ferris_wheel: Detect, analyze, and visualize protein symmetry
GNU Lesser General Public License v2.1
26 stars 16 forks source link

SINGLE Refiner incorrect in 2I5I.A #53

Closed lafita closed 9 years ago

lafita commented 9 years ago

The results of the SINGLE refiner on the PDB code 2I5I.A show inconsistent connected residues, that are too close in sequence. This results in a very short alignment and a poor subunit identification (notice, as an example, that brown and yellow residues are part of the same subunit):

2i5i a_bug

This can be considered a bug, because in the alignment graph of the original alignment, before refinement, (left) we can observe that there are at least 10 groups of 7 consistent connected residues, which have to be identified by the refiner without any doubt. Instead, the residues connected in the refined alignment (left) form only 4 groups and they are not consistent, because 63-59, 62-58, etc are too close in sequence that they must be either part of a same subunit or part of none.

2i5i a_bug_graph

A solution might be to check for consistency (a minimum residue distance between connected residues) in the Refinement algorithm, because no code fragment does it. By now only this example has been found to produce non-consistent refined alignments.

lafita commented 9 years ago

I close the issue because it seems a problem with the protein (in the benchmark it is reported as 8-fold), rather than a bug in the refinement/order detection. If any other example of this type is found the issue can be opened again.