EBISPOT / efo

Github repo for the Experimental Factor Ontology (EFO)
https://www.ebi.ac.uk/efo/
54 stars 14 forks source link

Review response to drug hierarchy for GO vs EFO terms #32

Open siiraa opened 6 years ago

siiraa commented 6 years ago

As a general rule, if a drug is available in Chebi, the corresponding 'response to' term should be created in GO and imported into EFO. Review all subclasses of response to drug that currently have an EFO ID to see if an appropriate GO term could be generated to replace them.

paolaroncaglia commented 5 years ago

A few thoughts about this:

paolaroncaglia commented 4 years ago

This ticket is currently in our Icebox, but when we come back to it, we may want to consider how Mondo is approaching representation of responses to drugs (https://github.com/monarch-initiative/mondo/issues/860).

paolaroncaglia commented 4 years ago

Update: the Mondo ticket has now been addressed, so I'll move this to Backlog, and when we have a chance, we could examine Mondo's approach and evaluate adopting it for EFO too.

paolaroncaglia commented 2 years ago

Updating this ticket to document recent discussion (on the GO tracker) re. obsoletion of 'response to drug' GO terms, some of which are used in EFO by GWAS curators: https://github.com/geneontology/go-ontology/issues/21998

kallia-p commented 2 years ago

@paolaroncaglia @zoependlington @matentzn @eks-ebi @dosumis Update: We are reviewing the representation and axiomatisation of all 'response to' terms currently in EFO. These comprise terms that were requested and imported from GO and terms that were generated in EFO as per the GWAS catalog and OTAR needs. The GO superclass is "response to stimulus" GO:0050896 with Subclasses "response to chemical" GO:0042221 of which "response to drug" GO:0042493 is a Subclass Of. We aim to align between EFO, OBA, GO, VT and CHEBI. EFO 'response to' terms that aren't in GO will be added to OBA. This is in line with MONDO 's approach: obsoleted all response to terms and added them in OBA (monarch-initiative/mondo#860). We will do a templated addition to OBA + xref/SSSOM mapping from EFO to OBA.

Template: https://docs.google.com/spreadsheets/d/13qh7dLE38vMyz91oRqj6GzKjFohNazNKAJxKG6Plw1o/edit#gid=213599470

Task 1: Manual mappings |   |   -- | -- | --   | 1 | Search on OLS (https://www.ebi.ac.uk/ols/ontologies/oba) whether suitable OBA term   | 2 | if suitable OBA term add OBA id in oba_id column, and OBA label in object_label)   | 3 | if not suitable OBA term, search OLS in VT (https://www.ebi.ac.uk/ols/ontologies/vt)   | 3a | if suitable VT term add VT id in vt_id column, and VT label in object_label)   | 4 | if no suitable OBA or VT term, create new OBA id in general_response table   | 4a | Search on GO on OLS (https://www.ebi.ac.uk/ols/ontologies/go) whether suitable response term   | 4b | if suitable GO response term, add to go_class column   |   | if no suitable GO response term, add GO:0042221 (response to chemical) or GO:0050896 (response to stimulus) to go_class column   | 5 | Look for a suitable CHEBI term on OLS (https://www.ebi.ac.uk/ols/ontologies/chebi)   | 5a | If CHEBI term, add to "chemical_entity" column   | 5b | If no CHEBI term, check PRO term on OLS (https://www.ebi.ac.uk/ols/ontologies/pr)   | 5c | If PRO term, add to chemical_entity column   | 6 | If chemical_entity is a drug used for treaments, write "yes" in the "drug" column We will also consider adding broad 'response to' subclasses in order to differentiate between 'toxicity/adverse effects' and 'response to treatment with' in line with the GWAS cat, OTAR and ClinVar (see monarch-initiative/mondo#860). OTAR requests top level terms terms such as drug resistance, drug toxicity which they wanted to be in the biological mechanism branch https://github.com/ebispot/efo/issues/1209
kallia-p commented 2 years ago

@dosumis @zoependlington @matentzn Work in this ticket should also address https://github.com/EBISPOT/efo/issues/166 We should probably be stripping is_about links to CHEBI compounds as part of work on this branch. ('has input' is the correct relation - based on GO schema).

kallia-p commented 2 years ago

@matentzn @dosumis

Just an update on the manual mapping approach and the slight modification to the instructions that Nico and I discussed. Briefly we discussed adding a couple of columns for the GO nearest parent and its label. This is because (for the terms that weren't in OBA or in VT) I was originally asked to add either response to stimuli or response to chemical under go_class; but in many instances the nearest parent in GO is more specific e.g. GO:0097334 - response to perphenazine has a nearest parent in GO response to antipsychotic drug. Therefore for this term the general go_class_label is response to chemical but the nearest_go_parent_label is response to antipsychotic drug.

For some other terms in GO there is a mutli-hierarchy of nearest parents so to reflect I have added all nearest parents separated by "|". e.g. GO:0097335 - response to quetiapine has two parental classes: response to organonitrogen compound | response to organic cyclic compound

In some instances and usually for EFO terms the nearest parents differ between GO and EFO (which I presume is because there was an attempt in EFO to introduce some structure which is not mirrored in GO) . e.g. EFO:0010103 - response to peginterferon alfa-2a has a nearest parent response to interferon in EFO. This term is not represented in GO but if it was it would be under nearest parent 'response to drug'. therefore for these type of terms I am still adding go_class 'response to chemical' and go_nearest parent 'response to drug' and I am documenting the EFO parents in the Comment box.

Nearest parents can also differ between EFO and GO for terms that are the same e.g. EFO:0007592 - response to bleomycin GO:1904975 - response to bleomycin. Nearest parent in EFO is GO:0097327 (response to antineoplastic agent) whereas nearest parent in GO is GO:1901652 (response to peptide)

helenp commented 2 years ago

Some of the response to chemical terms are so old that I was the creator of them. I think the nearest parents are an indicator of divergence between GO and EFO over time.

kallia-p commented 2 years ago

Thanks, yes indeed, for the last example above GO added response to peptide a year ago.

d0choa commented 2 years ago

Please don't constrain the modelling based on our previous requests (e.g. #1209). We are quite keen to get this information organised in EFO and we will adapt to whatever conclusion you reach.

As mentioned before, we have several datasets with response or resistance to drugs very often in the context of a disease: GWAS catalogue, ClinVar, Cancer Genome Interpreter, etc. Our biggest concern is to find an approach that scales with the amount of data generated.

kallia-p commented 2 years ago

@rays22 see comments for response to terms