OpenSourceMalaria / Series4_PredictiveModel

Can we Predict Active Compounds in OSM Series 4?
7 stars 10 forks source link

Genetic algorithm optimisation of Series 4 potency #32

Open jhjensen2 opened 3 years ago

jhjensen2 commented 3 years ago

The goal is to use my graph-based genetic algorithm (GA) to maximise pIC50 values predicted using machine learning (ML). Here are some preliminary results.

ML model 1 (ML1)

GA searches for molecules with large pIC50 values

Screenshot 2021-03-19 at 15 29 51 Screenshot 2021-03-19 at 15 31 51

How to select the best molecules, if any, for further study?

edwintse commented 3 years ago

Hi @jhjensen2, based on gut feeling of the compounds above, I suspect that adding another phenyl ring to either side of the molecule will end up decreasing the potency. It would be great if you could add filters for solubility and synthetic accessibility to see what kind of compounds you get after that?