seanrjohnson / protein_scoring

Generating and scoring novel enzyme sequences with a variety of models and metrics
MIT License
58 stars 11 forks source link

Computational Scoring and Experimental Evaluation of Enzymes Generated by Neural Networks

Colab notebooks

Notebook Description
ESM-MSA sampler uses the ESM-MSA model (a transformer-based neural network trained on protein multiple sequence alignments) to generate new protein sequences by iteratively mutating sequences from an input alignment.
Metrics Calculates various sequence- and structure-based quality scores for proteins, such as those produced by generative models.

Figures

Setup

conda env create --name protein_scoring -f conda_env.yml

jupyter lab

Related data and repositories

References