radical-collaboration / IMPRESS

Integrated Machine-learning for PRotEin Structures at Scale
https://radical-project.github.io/impress
1 stars 0 forks source link

Integrated Machine-learning for PRotEin Structures at Scale (IMPRESS)

IMPRESS is a high-performance computational framework to enable the inverse design of proteins using Foundation Models such as AlphaFold and ESM2.

IMPRESS pipeline design

Current implementation is capable of running ProteinMPNN on an input two-chain complex to redesign the entire receptor, creating new interactions to the substrate and solubilizing the protein. These designs are then submitted to AlphaFold-Multimer for structure prediction, where AlphaFold’s intrinsic discriminatory power and confidence metrics can be leveraged to determine if the input protein has improved. The predicted structures are then resubmitted to ProteinMPNN to continue optimizing the receptor. Over the course of many iterations, we expect the receptor to gradually improve, both in terms of stability and substrate binding affinity. The pipeline monitors this process, distributing the sequence generation, structure determination, and design analysis tasks evenly across the specified GPU and CPU allocations.

Middleware foundation

RADICAL-EnTK (Ensemble Toolkit) is used as a workflow management system, that enables the use of high-performance computing (HPC) platforms and features. EnTK-based designed pipeline provides the integrated ability to efficiently and effectively train sophisticated models. That requires advances in HPC workflow methodology that brings together the ability to “evaluate as you go”.

Resources

To learn more, please visit the project website at https://radical-project.github.io/impress/