Closed dustine32 closed 1 year ago
@ukemi I'm clueless on ythis EC question - I'd tag Harold but forget his tag.
Thanks @deustp01! Paging @hdrabkin on this!
@dustine32 it depends (doesn't it always?). When this file was first made over 20 years ago, the goal was to simply generate GO annotations for genes that had EC cross references at resources like MGI. With that goal in mind, some EC identifiers could be mapped to CC and BP terms if the enzyme described in the EC entry was enabled by a gene product and we could safely say that the gene product was part of a complex that carried out that MF or was always a part of a BP. The file has probably evolved a lot since we first made it, but that was the original intent. That said, for your purposes I would suggest that you only take EC->MF information for the translation. At the end of the day, since we are now including GP to Term relationships in all annotations we should rethink how these translation files are constructed. I think it would be prudent to include those relations if the files are still of value. @vanaukenk don't you think?
Ah, OK. I will just be careful and heavily comment that I'm filtering for EC-mapped MF terms only. No problem!
Thanks for the response @ukemi!
Closing since the fix has been merged for a while and running in the latest batch of YeastPathways models.
Use case: SGD pathway - [formaldehyde oxidation II (glutathione-dependent)](https://pathway.yeastgenome.org/YEAST/NEW-IMAGE?type=NIL&object=PWY-1801-1&redirect=T) Reaction 1.2.1.2-RXN has EC number 1.17.1.9, which is mapped to both formate dehydrogenase (NAD+) activity (GO:0008863) and formate dehydrogenase complex (GO:0009326). Of these two GO terms, only GO:0008863 is a function. The pathway conversion code should filter out any non-function terms to prevent invalid OWL situations like this: Where a formate dehydrogenase complex -part_of-> biological process.
@ukemi @deustp01 We probably discussed this already, but can you clarify whether or not EC's should ever be mapped to non-molecular function GO terms? If they are supposed to be MF-only, I will place this code change where it will be more widely used.