geneontology / go-ontology

Source ontology files for the Gene Ontology
http://geneontology.org/page/download-ontology
Creative Commons Attribution 4.0 International
223 stars 40 forks source link

Query nucleobase-containing small molecule metabolic process #15250

Closed pgaudet closed 6 years ago

pgaudet commented 6 years ago

Hi @ValWood

It looks like you are the one who requested 'nucleobase-containing small molecule metabolic process'. It seems to have been intended as a grouping term for nucleobases, a nucleosides, and nucleotides; right ?

Just wondering because (1) I am trying to add logical definitions but I dont think ChEBI has the equivalent process, right? (2) the respective 'metabolic' and 'catabolic ' children don't have any children; I suppose you also want the grouping at that level ?

Thanks, Pascale

ValWood commented 6 years ago

That correct.

This grouping is a really useful slim module... https://www.pombase.org/term_genes/GO:0055086 It has 199 pombe annotations all involved in "nuclotide/side/bas metabolism". There is overlap between these processes.

If you move up to the parent GO:0006139 - nucleobase-containing compound metabolic process (so for pombe this has pombe 1494 annotations) so is way too broad. It has nucleic acid metabolic process (DNA metabolism, transcription, translation) and lots of other stuff. GO:0098501 polynucleotide dephosphorylation!

I'm not even sure what is in there ... but it isn't useful because it has such unrelated stuff.....

pgaudet commented 6 years ago

So I should add correct children to the biosynthetic/catabolic processes then?

ValWood commented 6 years ago

Oh yes they should be there!

deustp01 commented 6 years ago

And I guess Val wants to exclude metabolism of nucleotide sugars - a standard biochemistry distinction not made by ChEBI. Indeed, that looks like an important part of the problem here - the ChEBI classification scheme clashes with the GO functional classification. Comments, @ukemi ?

ValWood commented 6 years ago

Yes for sure!

deustp01 commented 6 years ago

Expand on the cryptic request to @ukemi - when we were thinking about glycolysis, didn't we decide that the logic required us to allow something like glycolysis is_a nucleotide metabolism because ATP is_a nucleotide. Val clearly wants to be less tolerant and inclusive, and this kind of problem is going to recur (for example because ATP usage is fairly widespread). Any thoughts about a general strategy?

ukemi commented 6 years ago

Yes. But I think that is a different question than the one being considered here. Based on our definition of metabolic processes, we had decided this was ok because any process that used a molecule was a type of metabolism of that molecule.

I think the issue here is the creation of a grouping class that includes only nucleobases, nucleosides and nucleotides. This class appears not to exist in ChEBI. It seems that this class isn't really based on chemical structure, but is more of a custom made class that represents a hand-picked set of processes. Perhaps this is a good argument for asserted subclasses.

alanbridge commented 6 years ago

Might GO consider classifying reactions based on their "main pairs" (a concept borrowed from KEGG) - as these might fit more closely with the biological aim - and set aside more common "hub" metabolites - that generally serve the same role in many reactions?

So for a reaction like this:

a cholesterol ester + H2O = a fatty acid + cholesterol + H(+)

(http://www.rhea-db.org/reaction?id=36403)

the main pairs might be:

a cholesterol ester - cholesterol a cholesterol ester - a fatty acid

So this is an example of cholesterol, cholesterol ester, and fatty acid metabolism. The function of the enzyme is to regulate the levels of these metabolites rather than water (although H2O is essential in this hydrolase of class EC 3.1.1.13).

This issue of "hub" metabolites is one that is often considered in the context of metabolic networks; people might make connections between reactions only through main pairs. Might this be of use in reaction (function) classification as well as connection?

ValWood commented 6 years ago

I like this grouping term for slimming because these 3 processes have a large number of genes in common. Some of the sub-processes of the second descendant (grand children) are more closely biologically connected to each other than those of the first descendant (children), which makes it a sensible module to include in a slim set.

grouping

Numbers are annotated genes, PomBase.

Alan's suggestion sounds interesting.

ValWood commented 6 years ago

Actually this term just cropped up again for a figure I am trying to prepare:

nucleobase-containing compound catabolic process (GO:0034655)

should be a child of "small molecule catabolic process"

currently it isn't. Could that be added too?

pgaudet commented 6 years ago

Hi @alanbridge as far as I understood your comment, GO is already doing that with logical definitions, any participating molecule should inherit x metabolic process; isn't it already the case ? @ukemi is this right ?

ukemi commented 6 years ago

Yes and No. We don't have many logical definitions for functions. But if it is a participant in a metabolic process it is reasoned to be a type of that metabolism. There are nuances that get very tricky. We should discuss strategies in NYC.