Closed timn2008 closed 3 years ago
Hi timn2008, thank you for your detailed message which enabled me to reproduce the behavior. The line with DSSP ID 215 shows a chain break and a jump from residue 151 to 156. However, the PDB file (cif in my case) contains a residue in this place:
ATOM 2068 N N . LYS B 1 51 ? 21.570 63.818 -25.663 1.00 36.95 ? 151 LYS B N 1
[...]
ATOM 2080 H HZ3 . LYS B 1 51 ? 18.070 57.775 -27.207 1.00 0.00 ? 151 LYS B HZ3 1
ATOM 2081 N N . VAL B 1 52 ? 20.774 61.487 -28.178 1.00 30.37 ? ***152 VAL*** B N 1
[...]
ATOM 2087 H H . VAL B 1 52 ? 20.972 60.732 -27.587 1.00 0.00 ? 152 VAL B H 1
ATOM 2088 N N . GLN B 1 56 ? 25.723 58.089 -33.572 1.00 54.52 ? 156 GLN B N 1
[...]
ATOM 2098 H HE22 . GLN B 1 56 ? 30.936 57.615 -30.566 1.00 0.00 ? 156 GLN B HE22 1
Currently, our residue list is created while parsing the DSSP file and later filled while parsing the PDB file. VAL152 is not found in the DSSP file but in the PDB file and therefore assigned a 'fake DSSP ID' the same as ligands, for example. Later, when getting the end DSSP ID of the SSE, the function returns the highest DSSP ID of all residues of the SSE. This is fine in most cases but results in the misleading 'fake DSSP ID' of 331. That DSSP ignores the last residue before a chain break is a common issue I think. There are several possibilities for us to bypass the problem, e.g., return the DSSP ID of the last residue instead of highest DSSP ID, ignore these residues missing in DSSP as well or, requiring major re-implementations, move away from using DSSP IDs.
Concluding, this is not really wanted but expected behavior. Thank you for reporting this. It helps our discussion about how to treat residues, free amino acids, ligands, residues missing in DSSP and so on (@sonnta). If you want a work around for this issue, you could either of
I hope this helps. If you have any questions regarding our software, feel free to ask. Thanks again for your message.
Best wishes Niclas
I'm running VPLG with coils enabled (plcc_B_include_coils=true in ~/plcc_settings.txt) for PDB 1cgl and in 1cgl.cif_B_albelig_coils_PG.plg I get the following lines
which seems strange due to SSE ending id 331, given that the next SSE starts with 219. The corresponding part of DSSP file is:
Is this an expected behaviour ?