samsledje / D-SCRIPT

A structure-aware interpretable deep learning model for sequence-based prediction of protein-protein interactions
http://dscript.csail.mit.edu
MIT License
87 stars 18 forks source link

D-SCRIPT

D-SCRIPT Architecture

D-SCRIPT PyPI DOI Documentation Status License Code style: black CodeFactor

D-SCRIPT is a deep learning method for predicting a physical interaction between two proteins given just their sequences. It generalizes well to new species and is robust to limitations in training data size. Its design reflects the intuition that for two proteins to physically interact, a subset of amino acids from each protein should be in con-tact with the other. The intermediate stages of D-SCRIPT directly implement this intuition, with the penultimate stage in D-SCRIPT being a rough estimate of the inter-protein contact map of the protein dimer. This structurally-motivated design enhances the interpretability of the results and, since structure is more conserved evolutionarily than sequence, improves generalizability across species.