opentargets / issues

Issue tracker for Open Targets Platform and Open Targets Genetics Portal
https://platform.opentargets.org https://genetics.opentargets.org
Apache License 2.0
12 stars 2 forks source link

Create biosample index to resolve Tissue/cell type entities #3445

Closed d0choa closed 2 months ago

d0choa commented 2 months ago

As discussed with @Tobi1kenobi and @DSuveges, we want to create an index containing all the valid tissue/cell type entities for the Open Targets Platform. This will help with validation and resolve labels, ontology, etc.

The most immediate use case is to resolve all the biosample identifiers' eQTL catalogue studies at the API level. In the future, this data can be used for many other use cases: baseline expression, literature-mining, etc. Currently, the QTL study index contains the biosampleId (e.g. UBERON code), but we will rely on the index to retrieve labels or any additional data.

Screenshot 2024-09-06 at 10 51 38

A few tasks:

Tobi1kenobi commented 2 months ago

This has taken a little longer than expected as I initially wrote everything for owl but have since shifted to JSON ontology files. But the code should all be in https://github.com/opentargets/gentropy/compare/alegbe-biosample_index

Would like to test it against the study index currently in gentropy before I make a pull request but unsure of best way to do this

addramir commented 2 months ago

Is it related to tissue/cell line catalogue we previously discussed?

Tobi1kenobi commented 2 months ago

Is it related to tissue/cell line catalogue we previously discussed?

I think so. Long-term it would be great to have unified biosample pages which gather evidence across variants, genes, studies, etc in the same way the variant, gene and study pages do

And an index is certainly necessary for that