This repository contains the materials for an introduction to AI tools for drug discovery, delivered at the 5th H3D Symposium in Livingstone, Zambia. The demo contained within this repository is only brought alive online during the workshop delivery. If you wish to use it for demo purposes, please email us at hello[at]ersilia[dot]io
You can find detailed information about the proposed demo in this page. Find below a quick summary!
The participant is presented with the following problem statement: We are a laboratory specialised in antimicrobial research. We have received a library of compounds from a collaborator and we need to explore it and identify the best candidates.
Some of the considerations to take into account:
A simple plug to GPT3.5 to ask questions around pathogens.
Train a simple classifier model based on the dataset for A.baumannii activity reported in Liu et al, 2023. The participants can play with different cut-offs and the demo performs an automated 5-fold cross-validation.
Participants will use a given dataset (prepared from ChEMBL) and run predictions using the just trained A.baumannii model as well as two models from the Ersilia Model Hub: Synthetic Accessibility Score (Ertl et al, 2009) and hERG cardiotoxicity (Jiménez-Luna et al, 2021). The goal is to select the best molecule according to the predicted values (high activity against A.baumannii, good synthetic accessibility and low cardiotoxicity).
Using the best candidate, try the MolMIM generator (Reidenbach et al, 2022) to obtain analogues with better drug-likeness (QED and LogP).
We use a quick search on Chem-Space to find which compounds are directly purchasable.
Requirements are listed in the Dockerfile. To run it locally:
streamlit run app/app.py
To run the app, the models will need to be deployed online/locally. The URL's for the models can be modified in the info.py
file.
In addition, we use a number of services that require an API (all of them offer free credits that should be sufficient for the usage in this demo). Please create an .env
file with the required API Keys.
The models used in this demo come from:
The materials for this workshop are distributed under a CCY-BY-4 License and the code is made available under a GPLv3 License. Please cite appropriately the Ersilia Open Source Initiative and the H3D Foundation when using our materials.
This is a prepared workshop. Data has been curated to facilitate the student's learnings and does not represent a real scenario.