MonomerLibrary / monomers

dictionary of monomers and links
GNU Lesser General Public License v3.0
14 stars 37 forks source link

fix problematic links and modifications #26

Closed GaribMurshudov closed 1 year ago

GaribMurshudov commented 1 year ago

I have changed some links and modifcations: MET-TYR, SF32-CYS --> SF37-CYS, FOR-LYZ. They need to be tested. Link instruction for MET-TYR LINK: RES-NAME-1 MET ATOM-NAME-1 SD RES-NAME-2 TYR ATOM-NAME-2 CE1 CHANGE CHARGE 1 SD 1

keitaroyam commented 1 year ago

I think it's better to keep the name TYRmod3 (rename TYRm1), and wrong references (see above) should be also fixed.

GaribMurshudov commented 1 year ago

Do you think METmod1 also should stay. Current version of acedrg makes TYRm3, METm1 etc. That is the problem with manual addition. There are always errors I have changed TYRm3 to TYRmod3 and tried to fix related issues. We need automatic update. Otherwise the problems will accummulate. We also need to start thinking about atom type dependent links. Now acedrg writes out acedrg atom types. We need to think how to use them. There should be a seperate topic on this matter.

keitaroyam commented 1 year ago

I am working on it but do not think it will be possible so soon. By the way do you have any example PDB for these updated links?

GaribMurshudov commented 1 year ago

I am working on it but do not think it will be possible so soon. By the way do you have any example PDB for these updated links?

Examples where in the list of all possible links (essentially close contacts) calculate by Rob. I will try to find them. It is another issue: we need an example pdb files (at least one, several would be good, if we would have all pdb files it would be even better) for each link. They should stored in the same place (or close by) as the list of link instructions.

keitaroyam commented 1 year ago

OK, maybe we can find all examples after fixing all problematic links.

GaribMurshudov commented 1 year ago

OK, maybe we can find all examples after fixing all problematic links.

Let us do this. I need an automatic update of mon_lib_list.cif. Otherwise every time I add something there will be a new error.

keitaroyam commented 1 year ago

I fixed links related to 4D4 and MS6, as I needed them. I deleted pept-4D4 and 4D4-pept since they can be handled by the alias mechanism.

GaribMurshudov commented 1 year ago

It seems that the number of errors getting smaller.

keitaroyam commented 1 year ago

Should we keep NH1/NH2/NH3 modifications?

GaribMurshudov commented 1 year ago

They are not needed anymore (I think). But after removing them we should test refmac. If they are used they would be used in refmac. I hope I removed all references to these modifications. We also do not need COO (I think)

keitaroyam commented 1 year ago

OK, we should come back later. I updated following links to make test_monlib.TestMonlib happy. mod_DELk-O2 and mod_DEL-O1 will be revisited after sorting out sugars.

02J-pept CYS-CYC HIS_TYR1 DEL-NMH (mod) PJE-010 CYS-PEB NH2 (mod, just fixed id)

It seems several planes were unnecessarily removed in the original modifications (CYCmod1, PEBmod1, TYRmod1).

I used Acedrg 271 and following parts were fixed manually:

MS6-pept sp2_sp2_1 1 S 1 C 2 N 2 CA 0.000 5.0 2

changed to CA-CA, 180

HIS_TYR1 sp2_sp3_2 1 CB 1 CG 1 ND1 2 CB 0.000 0.0 2

changed to const*, period 1

keitaroyam commented 1 year ago

I updated following links together with TRP/TYR/GTP monomers to avoid redundant modifications. CR8-pept GTP-p HIS_TYR2 TRP-TYR pept-CR8

Following links need to be updated, and I will wait for acedrg problems to be fixed. CYS-FAD HIS-FAD1 HIS-FAD2

This branch should be carefully tested before merging to master.

keitaroyam commented 1 year ago

Now all errors in CI are sorted. But there may be an error in chemistry. Later will find example pdbs and test refinement to merge this to master.

GaribMurshudov commented 1 year ago

For some reason I cannot approve. Somebody else wil need to approve. We will need to think about chemistry a bit more. current thought (needs to be clarified, discssed and implemented) 1) count bond orders and make sure that they are consistent 2) check ligands in their nattive environment (in macromolecules). This will take soem time 3) Pattern match local substractures to those from COD

keitaroyam commented 1 year ago

This is because you started this PR. I can approve, but it's too early to do it. Let us do tests for updated links.

keitaroyam commented 1 year ago

It seems FOR-LYZ link does not exist in PDB. Why was it added? Perhaps FOR-LYS was intended?

GaribMurshudov commented 1 year ago

IT could well be. Is there FOR-LYS?

keitaroyam commented 1 year ago

Yes, but only three examples: 1pw1 2rus 3tax (note this is just from LINK records; more links may be found if we add FOR-LYS)

GaribMurshudov commented 1 year ago

Ok. Let us replace FOR-LYZ with FOR-LYS

keitaroyam commented 1 year ago

OK, but this kind of "group" monomers (FOR is a formyl group) are used in several links (and sometimes it bridges two monomers). We would need to define many links in the current mechanism.

GaribMurshudov commented 1 year ago

I do not think we should increase the number of links at the moment. Instead we should work on new mechanisms for links. Now, acedrg writes out acedrg atom types. I was thinking to use these atom types for link definitions. At the moment the idea is not fully ready yet.

keitaroyam commented 1 year ago

Done. I think we can also delete AHT-ALA, as AHT is no longer used. FOR_C-C does not also exist in PDB and perhaps we can remove it, while this may be added for completeness when FOR_C-N was added.

GaribMurshudov commented 1 year ago

Yes. We should remove any link that does not exist in the pdb or is impossible. What was AHT?

keitaroyam commented 1 year ago

It seems AHT was used in 1tmb, and probably replaced by VLT.

GaribMurshudov commented 1 year ago

I see. We even do not have AHT. I agree that we should remove all links referenced to non-existing ligands

keitaroyam commented 1 year ago

We have AHT, but may not need it.

I tested updated links with high resolution PDBs. It seems the most problematic one is SF37-CYS (in 6fpo, 5adu etc). SF3 should also be fixed probably.

Others are more or less ok. A concern is planarity in bridged planes, as in MET-TYR, TRP-TYR, and HIS_TYR2 (in 5jhx and 7coh).

Planarity issue also happens in a monomer: 2gw3 B/CR8 63/C17 and 2oln A/FAD 1001/C8M for example.

Another concern may be sp2_sp2 sigmas, and we may come back later? For example PJE-010's sp2_sp2_1 as in 2q6f and in CYC and PEB.

keitaroyam commented 1 year ago

Updated a test, and found new problems (DEL-OXT not applicable to SUI, ORN-NE removing undefined HE3). Fixed, but not tested yet.

keitaroyam commented 1 year ago

pept-ORN/pept-SUI/SUI-pept all seem to be fine. Fe-S clusters should be handled in another issue.

GaribMurshudov commented 1 year ago

Ok. Let us open another issue