JacksonH44 / PPI-prediction

MIT License
0 stars 0 forks source link

Generate report of lengths of proteins and protein complexes #36

Closed JacksonH44 closed 2 months ago

JacksonH44 commented 2 months ago

What is the context of the feature? Generate a file of all proteins & protein complexes in the dataset.

Describe the solution Call the Ensembl API to find lengths for each protein, then get the lengths of each complex, and sort the proteins/complexes shortest to longest. This will ensure that the shortest proteins are ran first through ColabFold, so they aren't stuck waiting behind longer sequences in the job queue.

JacksonH44 commented 2 months ago

Resolved by #37