Open siiraa opened 7 years ago
A few thoughts about this:
This ticket is currently in our Icebox, but when we come back to it, we may want to consider how Mondo is approaching representation of responses to drugs (https://github.com/monarch-initiative/mondo/issues/860).
Update: the Mondo ticket has now been addressed, so I'll move this to Backlog, and when we have a chance, we could examine Mondo's approach and evaluate adopting it for EFO too.
Updating this ticket to document recent discussion (on the GO tracker) re. obsoletion of 'response to drug' GO terms, some of which are used in EFO by GWAS curators: https://github.com/geneontology/go-ontology/issues/21998
@paolaroncaglia @zoependlington @matentzn @eks-ebi @dosumis Update: We are reviewing the representation and axiomatisation of all 'response to' terms currently in EFO. These comprise terms that were requested and imported from GO and terms that were generated in EFO as per the GWAS catalog and OTAR needs. The GO superclass is "response to stimulus" GO:0050896 with Subclasses "response to chemical" GO:0042221 of which "response to drug" GO:0042493 is a Subclass Of. We aim to align between EFO, OBA, GO, VT and CHEBI. EFO 'response to' terms that aren't in GO will be added to OBA. This is in line with MONDO 's approach: obsoleted all response to terms and added them in OBA (monarch-initiative/mondo#860). We will do a templated addition to OBA + xref/SSSOM mapping from EFO to OBA.
@dosumis @zoependlington @matentzn Work in this ticket should also address https://github.com/EBISPOT/efo/issues/166 We should probably be stripping is_about links to CHEBI compounds as part of work on this branch. ('has input' is the correct relation - based on GO schema).
@matentzn @dosumis
Just an update on the manual mapping approach and the slight modification to the instructions that Nico and I discussed. Briefly we discussed adding a couple of columns for the GO nearest parent and its label. This is because (for the terms that weren't in OBA or in VT) I was originally asked to add either response to stimuli or response to chemical under go_class; but in many instances the nearest parent in GO is more specific e.g. GO:0097334 - response to perphenazine has a nearest parent in GO response to antipsychotic drug. Therefore for this term the general go_class_label is response to chemical but the nearest_go_parent_label is response to antipsychotic drug.
For some other terms in GO there is a mutli-hierarchy of nearest parents so to reflect I have added all nearest parents separated by "|". e.g. GO:0097335 - response to quetiapine has two parental classes: response to organonitrogen compound | response to organic cyclic compound
In some instances and usually for EFO terms the nearest parents differ between GO and EFO (which I presume is because there was an attempt in EFO to introduce some structure which is not mirrored in GO) . e.g. EFO:0010103 - response to peginterferon alfa-2a has a nearest parent response to interferon in EFO. This term is not represented in GO but if it was it would be under nearest parent 'response to drug'. therefore for these type of terms I am still adding go_class 'response to chemical' and go_nearest parent 'response to drug' and I am documenting the EFO parents in the Comment box.
Nearest parents can also differ between EFO and GO for terms that are the same e.g. EFO:0007592 - response to bleomycin GO:1904975 - response to bleomycin. Nearest parent in EFO is GO:0097327 (response to antineoplastic agent) whereas nearest parent in GO is GO:1901652 (response to peptide)
Some of the response to chemical terms are so old that I was the creator of them. I think the nearest parents are an indicator of divergence between GO and EFO over time.
Thanks, yes indeed, for the last example above GO added response to peptide a year ago.
Please don't constrain the modelling based on our previous requests (e.g. #1209). We are quite keen to get this information organised in EFO and we will adapt to whatever conclusion you reach.
As mentioned before, we have several datasets with response or resistance to drugs very often in the context of a disease: GWAS catalogue, ClinVar, Cancer Genome Interpreter, etc. Our biggest concern is to find an approach that scales with the amount of data generated.
@rays22 see comments for response to terms
As a general rule, if a drug is available in Chebi, the corresponding 'response to' term should be created in GO and imported into EFO. Review all subclasses of response to drug that currently have an EFO ID to see if an appropriate GO term could be generated to replace them.