Open joaomcteixeira opened 2 years ago
That's right. However, I feel we are adding too much to the purpose of merge. All that should be done by tidy
. Maybe if we exchange the -strict
option for tidy
we don't need to touch the merge
. What do you think? Because the merge
was really designed to be a concatenator
Yes - but why does it remove the END statements then? Other code don’t do that (e.g. pdb_chain).
The current version of pdb_merge
does not remove any lines from the input files. It's a straight concatenation.
So a simple cat command effectively.
Meaning pdb_tidy should always be run to correct things.
A very different behaviour than pdb_mkensemble
for example.
Yes. mkensemble
has the clear purpose of making a correct ensemble of structures. While merge
assumes the user knows what he/she is doing when concatenating the files. Likely the user cleaned the PDBs before using merge.
hmmm…. assumptions… this is asking for troubles…
It is this make clear to the users?
Yes. mkensemble has the clear purpose of making a correct ensemble of structures. While merge assumes the user knows what he/she is doing when concatenating the files. Likely the user cleaned the PDBs before using merge.
Good point. Let's first clarify that to users before changing the behavior of pdb_merge
.
I was reviewing this now, and I still agree we should not touch pdb_merge
for this purpose and pipe the results to pdb_tidy
as the logic is not that straightforward. It is stated in the docs.
The contents are not sorted and no lines are deleted (e.g. END, TER
statements) so we recommend piping the results through `pdb_tidy.py`.
How will you explain users that pdb_merge
does require pdb_tidy
to make a correct PDB while pdb_mkensemble
not...
You are right. I am addressing this. It's not as straightforward as adding an END
because also TER
was not accounted for. If TER
s exist in the output is because they were already present in the input. I am working on it.
When doing pdb_merge
, should atom numbers be renumbered starting from 1
? This would change the original atom numbers but avoid repeated numbers.
Probably good indeed
Hi @amjjbonvin
I have addressed this issue, plus some other details. Now, pdb_merge
operates as follows:
REMARK
, ...)Inside tests/data/
there are three PDBs, dummy_merge_A/B/C.pdb
that you can use to test. Test also with others you may know, and let me know.
Closes #149