macrocosm-os / folding

Decentralized Protein Folding Bittensor Subnet
https://www.macrocosmos.ai/sn25
MIT License
14 stars 16 forks source link

How many proteins are eligible for simulation? #19

Closed steffencruz closed 4 months ago

steffencruz commented 6 months ago

How can we know if a pdb is going to be a good candidate for simulation? Are the initial conditions that stop us from being able to simulate? Estimate the total number of eligible candidates.

schampoux commented 6 months ago

There are indeed initial conditions that stop us from being able to simulate with the intent of getting good experimental results.

Not all PDB files can be simulated using GROMACS to obtain good results. The following are concerns that contribute to the suitability of a PDB file for simulation:

schampoux commented 6 months ago

It may be of interest to restrict ourselves to a specific type of molecule, therefore limiting force field selection. With this approach, we will be limiting guess work for selecting hyperparameters. Rodrigo will be providing us with subsets of PDB files that have similar configurations.

schampoux commented 6 months ago

If the goal is energy minimization in the simulations and we wish to switch out PDB files with minimal adjustments to hyperparameters, structural similarity is the most relevant criterion.

  1. Consistent Force Fields: likely to be suitable for one another because they will have similar 3d conformations and therefore similar physical and chemical environments for the atoms involved. The following parameters would not need significant adjustments: van der Waals forces, electrostatic interactions, and bond energies.
  2. Minimal Setup Changes: Allows us to maintain similar simulation conditions (e.g. solvation parameters, ionic strength, temperature)
  3. Relevance to Functional Insights.