Open dayang-us opened 4 years ago
Very nice work. Odd that the SMILES input didn't seem to work for you. Are the strings working in anything else? Want to post them here? For the molecule suggestion, did you think about just replacing the OH with NH2? Synthetically probably easier, I would guess? Could you also please just clarify (maybe by just adding a figure to the above) what AZ8074 and AZ9 look like?
I've had a look at the eNTRyway website and it seems to be giving an error with any submission at the moment (including some that previously were fine) so it seems to be an issue there.
You can install the eNTRyway tool locally from https://github.com/HergenrotherLab/entry-cli. Just to test this I've run the proposed compound successfully,
> python calc_props.py -s "CN1N=C(NC2=NC(N[C@@H](C(N)O)C3=CC=CC=C3)=NC3=C2C=NN3)C=C1C(C)(C)C"
..tot conformations = 186624
..tot confs tested = 100000
..below energy threshold = 40509
Properties for CN1N=C(NC2=NC(N[C@@H](C(N)O)C3=CC=CC=C3)=NC3=C2C=NN3)C=C1C(C)(C)C
--------------------------
Mol. Wt.: 421.498780
Formula: C21H27N9O
RB: 7
Glob: 0.106897
PBF: 1.080287
primary_amine: True
Regarding the MOE 'Glob' descriptor, my understanding that the eNTRyway tool's globularity is designed to be a freely-available mimic of this.
Very nice work. Odd that the SMILES input didn't seem to work for you. Are the strings working in anything else? Want to post them here? For the molecule suggestion, did you think about just replacing the OH with NH2? Synthetically probably easier, I would guess? Could you also please just clarify (maybe by just adding a figure to the above) what AZ8074 and AZ9 look like?
Hi Prof @mattodd, I've updated the Issue with the chemical structures and SMILES notation. I agree with replacing OH with NH2, I'll look into this for the 2nd iteration of the project! Thank you!
I've had a look at the eNTRyway website and it seems to be giving an error with any submission at the moment (including some that previously were fine) so it seems to be an issue there...
Hi @Andy-Wilkinson this is really great, thank you so much for helping out and linking me to the internal eNTRyway tool! Will definitely make good use of it.
Update:
python calc_props.py -s "CC(C)(C)c1cc(Nc2c(cn[nH]3)c3nc(N[C@H](c3ccccc3)N)n2)nn1C"
..tot conformations = 62208
..tot confs tested = 62208
..below energy threshold = 31362
Properties for CC(C)(C)c1cc(Nc2c(cn[nH]3)c3nc(N[C@H](c3ccccc3)N)n2)nn1C
--------------------------
Mol. Wt.: 391.472800
Formula: C20H25N9
RB: 6
Glob: 0.094911
PBF: 0.992900
primary_amine: True
Tried it with a different molecule and the program worked perfectly, thanks again 👍
@dayang-us I have made an RDKit implementation of vsurf_A. I couldn't find the details of the MOE implementation so I just used what is in RDKit and scaled the values empirically to values that are somewhat similar to those in the Richter paper.
@dayang-us I have made an RDKit implementation of vsurf_A. I couldn't find the details of the MOE implementation so I just used what is in RDKit and scaled the values empirically to values that are somewhat similar to those in the Richter paper.
This is really great, @jhjensen2! Thank you very much for this.
just to mention, Data Warrior has an option to calculate globularity.
In case you have not used, the option is in DataWarrior/Chemistry/From chemical structure/Calculate Properties/3D/ It gives two globilarity calculation options. I copied both down:
Hi all!
Some background info:
For my project I have managed to modify the AZ hit compound, AZ5595 by adding a primary amine to the ethanolamine moiety. I surmised that it's the best place to add a structure as to not interfere with the hydrogen bonds formed with the scaffold, and the t-butyl and phenyl ethanolamine moieties yielded the submicromolar potency against Pseudomonas aeruginosa.
Rationale: (Edited 17/12/2020 with correct interactions)
Resulting molecule:
I'm trying to extend its spectrum of activity based on the eNTRy criteria, quoted from the paper below:
From their study, there is evidence that adding a primary amine is not enough to extend the spectrum of activity of the compound, e.g. Vancomycin (has a primary amine, 13 rotatable bonds, Glob=0.28, Gram-positive only)
Questions I need help with:
I am trying to put together some data on the molecules I'm studying, but encountered some roadblocks:
Submitting the SMILES notation to the eNTRyway website did not work. (I.e. It gave me an error status) I suspected it could be that my SMILES notation is wrong but it also did not work for SMILES notation extracted from database for known antibiotics such as Ampicillin and Penicillin G. Does anyone have any experience with the eNTRyway website, perhaps they require a specific type of SMILES notation?
The paper used MOE to describe the structures' Glob and VSurfA but I do not have access to the software (also was not listed on my university's database). I tried to find open source alternatives, the closest I could find for Glob is the Spherocity Index from RDKit's Descriptors3D module. I could not confirm if Glob and Spherocity Index are interchangeable. They do use the same scale, 1 = perfect sphere and 0 = benzene. Does anyone have any ideas and happy to help with this?
I tried to glean as much information from DataWarrior's Molecular Descriptors function, and came up with this table:
_NB: AZ5595 is the hit compound, AZ5595DN1 is the molecule with primary amine added, AZ8074 is the chemical synthesis primer, and AZ9 is one of the compounds from the SAR studies with reduced potency.
AZ5595 has promising Spherocity Index but it has 7 rotatable bonds, which is more than the upper limit specified in the eNTRy criteria. The modified AZ5595 naturally will inherit the same number of rotatable bonds, and this time, with increased spherocity. I would really appreciate suggestions on alternative ways of adding the primary amine and meeting the eNTRy criteria from the trained eyes of OSA.
Thank you!
Relevant papers:
Update 28/10/2020:
Here's a table that summarises the molecules' structures and SMILES code generated from DataWarrior.
CC(C)(C)c1cc(Nc2c(cn[nH]3)c3nc(N[C@H](CO)c3ccccc3)n2)nn1C
CC(C)(C)c1cc(Nc2c(cn[nH]3)c3nc(NC(C(N)O)c3ccccc3)n2)nn1C
OCC(CCCC1)N1c1nc(Nc2n[nH]c(C3CC3)c2)c(cn[nH]2)c2n1
CC(C)[C@H](CO)Nc1nc(Nc2nn(C)c(C(C)(C)C)c2)c(cn[nH]2)c2n1
The SMILES notations seems to communicate well with CHEMBl's MarvinJs Import structure function and OpenBabel did detect them and converted into sdf files. However, I think I've generated 4 different SMILES notation for AZ5595_DN (generated by DataWarrior, ChemSketch and ChemBL search query). Not sure if there's an error with the sketched structure that is causing the problems? Tautomeric states and chirality perhaps?