OpenBioML / chemnlp

ChemNLP project
MIT License
148 stars 45 forks source link

Add Open Targets datasets for drug information #138

Open jackapbutler opened 1 year ago

jackapbutler commented 1 year ago

Summary

The open targets database has multiple interesting datasets that could be incorporated into this project.

We can also enrich these datasets with additional SMILE (or other molecule metadata) using this API endpoint replacing the given ID with our CHEMBL ID's in this dataset.

Subtasks

jackapbutler commented 1 year ago

The discussion is related to how we choose to handle datasets which have lists of identifiers associated to a single property.

i.e. X,Y and Z drugs were associated with disease A