rvanasa / deep-antibody

COVID-19 monoclonal antibody screening research.
0 stars 0 forks source link

Find optimal docking parameters using Hex #2

Open rvanasa opened 4 years ago

rvanasa commented 4 years ago

Docking simulators give a useful although rather janky metric for how an antibody would potentially bind to the target antigen. Most of these tools generate thousands of possible configurations and then leave the user to perform some sort of post-processing such as energy minimization to retrieve useful information from the simulation.

We are focusing on three different docking tools:

Each of these has its own benefits and drawbacks. Hex is very fast and relatively easy to use; Frodock is state-of-the art but somewhat slower; and ZDock is well-established but relatively inaccurate and slow.

The goal is to create a pipeline where we can automatically dock antibodies to our target and evaluate how much the three simulators agree with each other. Configurations with the most agreement across simulations have been demonstrated to be much more reliable in their contact predictions.

The idea is that we feed the contact points from this pipeline into the deep learning model, which returns a score based on how strong the docking configuration would be in real life.

Fine-tuning the docking simulations is going to require learning how proteins and ligands interact with each other, so this is a perfect entry point for anyone who wants to dig into the microbiological side of the project.

How to start:

Common terminology:

Note that in most simulators, the antibody is called the "receptor" while the antigen is called the "ligand." If you need any other clarification on terminology, be sure to let me know since other people will probably run into it as well.

Here's a bit of documentation for Hex: http://hex.loria.fr/manual800/hex_manual.pdf

imcsherry commented 4 years ago

Currently learning to use Hex. I've done a little bit of messing around and didn't get super far on anything productive - hopefully I can have some useful data in the next couple days

rvanasa commented 4 years ago

Awesome; thanks for having a go at it! Definitely keep me posted.

Ideally, we want to find a way to record the most common contact points for all the 1000+ docking poses that Hex generates on each run. That way, we can overlap this with other systems to avoid predicting weird contact points (like in the center of the antigen, etc).

Let me know if you find something related to this, or if you come across anything else interesting / cool in the software.