PyEED / pyeed

🧬 Toolkit to create, annotate, and analyze specialized sequence databases
https://pyeed.github.io/pyeed/
MIT License
3 stars 5 forks source link

Information from UniProt should be parsed via REST-API #10

Closed haeussma closed 7 months ago

haeussma commented 1 year ago

When initializing a ProteinSequence via the from_ncbi method from an u UniProt ID, information is incomplete. For instance, the coding sequence reference is missing.

UniProt offers a REST-API for protein entries, including information on protein activity, references to alphafoldBD, and GenBank.

I think the data model should be extended to capture these information and all references to other databases.

haeussma commented 7 months ago

solved in #56