Closed joaomcteixeira closed 3 years ago
In my opinion, removing all CONECT
statements in pdb_tidy
would be enough.
CONECT
statements need a topology. Some of the tools do some legwork to correct CONECT statements when the atom serial number change, but that's pretty much the only time we look at them. We could do something similar somewhere else (maybe pdb_tidy, but it's growing too much for my liking...) and remove CONECT records that match unknown serial numbers. Alternatively, going a bit more ahead, we could use simple chemistry and establish that a carbon cannot have more than 4 bonds, so we could issue warnings on those statements.
All things considered, my vote is to ignore CONECT statements altogether. Garbage in, garbage out.
I support that - CONECT statements are too tricky.
If people need them then they can use for example PyMol to generate them - but even then there is probably no warranty they are correct.
Following your comments, I conclude the solution is the remove all CONNECT within pdb_tidy
. :+1:
A user suggested:
We have discussed the possibility to remove all CONECT lines directly because assessing this requires knowing the chemical structure of the molecule, as you have said in our discussions. But also we can keep the CONECT related to S-S bonds easily in case of
ATOM
entries, on the other hand.For
HETATM
entries, there are to main problems identified, from https://www.wwpdb.org/documentation/file-format-content/format33/sect10.html:Some questions
@JoaoRodrigues @amjjbonvin @brianjimenez @mtrellet
ATOM
entries?pdb_tidy
handle this or a new tool instead?