seanrjohnson / protein_scoring

Generating and scoring novel enzyme sequences with a variety of models and metrics
MIT License
58 stars 11 forks source link

Protein Scoring on Google Colab #3

Open nkim23nyk6 opened 7 months ago

nkim23nyk6 commented 7 months ago

Hi Sean,

I've been using the Protein Metrics on Google Colab for the past few days, but for some reason I keep getting errors when I try to calculate scores using ESA-MSA sampler-generated sequences.

The error occurs after I upload a list of sequences and run the Single-sequence metrics.

I'm not too familiar with how codes work on Google Colab, so I was wondering if you would be able to resolve the above issue.

Thanks! Nam

seanrjohnson commented 7 months ago

What metrics are you trying to calculate? What errors are you seeing?

I just did a quick test run. Downloaded "sequences.zip" from here: https://zenodo.org/records/10594384 extracted CuSOD_round2_train.fasta, copied the first 6 sequences into a new file called templates.fasta, used those two files in the sequence generation notebook, and then used the output in the metrics notebook. I only did the single sequence and alignment based metrics, because I didn't want to make AlphaFold structures, but everything seemed to work.