webyrd / mediKanren

Proof-of-concept for reasoning over the SemMedDB knowledge base, using miniKanren + heuristics + indexing.
MIT License
323 stars 53 forks source link

Support queries to find proteins with certain domains #13

Open webyrd opened 6 years ago

webyrd commented 6 years ago

For example, find all human proteins that contain a specific domain

(Feature request from a clinical researcher.)

webyrd commented 6 years ago

Andrew Su points out that InterPro is very good at this sort of thing:

http://www.ebi.ac.uk/interpro/

The full InterPro data sources can be downloaded--some of the files seem quite large (40GB compressed for the largest file).