geneontology / go-ontology

Source ontology files for the Gene Ontology
http://geneontology.org/page/download-ontology
Creative Commons Attribution 4.0 International
220 stars 40 forks source link

symbiont process #15497

Closed pgaudet closed 3 years ago

pgaudet commented 6 years ago

From @ValWood on March 28, 2018 16:10

human has 1403 annotations to "symbiont process". I find these very misleading. The practice of annotating human genes to viral processes seems very strange to me full stop...

Copied from original issue: geneontology/go-annotation#1891

pgaudet commented 6 years ago

From @ValWood on March 28, 2018 16:16

for example 600+ human genes

annotated to GO:0019058 viral life cycle Definition A set of processes which all viruses follow to ensure survival; includes attachment and entry of the virus particle, decoding of genome information, translation of viral mRNA by host ribosomes, genome replication, and assembly and release of viral particles containing the genome. Source: ISBN:1555811272

...makes it very difficult to identify/ analyse known/unknown human processes, these don't fit the proposed definition of a GO process

pgaudet commented 6 years ago

From @ValWood on March 28, 2018 16:17

"A biological process represents a specific objective that the organism is genetically “programmed” to achieve. https://github.com/geneontology/go-ontology/issues/14899

pgaudet commented 6 years ago

We need to fix the ontology (there are lots of tickets). There are many children that are either dubious or at the wrong place. Examples:

Pascale

pgaudet commented 6 years ago

From @ValWood on March 28, 2018 18:38

OK, we'll ignore these terms for our purposes...

pgaudet commented 6 years ago

From @ValWood on March 28, 2018 18:38

you can close this if it's covered...

pgaudet commented 6 years ago

From @cmungall on March 28, 2018 19:36

Relevant ticket: https://github.com/geneontology/go-ontology/issues/14807

I think many of the part-of children of symbiont process should in fact be regulates.

pgaudet commented 6 years ago

From @ValWood on March 28, 2018 20:0

I think many of the part-of children of symbiont process should in fact be regulates.

I don't think that solves the problem.

Human genes are not "regulating" viral processes....virus are co-opting human processes for their own purposes..... A big difference.

pgaudet commented 6 years ago

From @cmungall on March 28, 2018 22:7

Ah sorry I realized I forgot to give context.

I just picked a random term with annotations under 'symbiont process', http://amigo.geneontology.org/amigo/term/GO:0070946, neutrophil mediated killing of gram-positive bacterium. (a bacterial one, not viral)

symb

There are a couple of mouse genes with experimental annotations to this. So this was bacterial and there isn't any co-option involved. So it's hopefully more straightforward, but in fact there are a number of issues here.

First is this:

id: GO:0051702
name: interaction with symbiont
relationship: part_of GO:0044403 ! symbiont process

This doesn't seem right but I'm not sure I fully understand the new definition of symbiont process (which is kind of an odd term no biologist would use, the only hits in google are to GO, or apparently "symbiont process engineer" is a job at hewlett packard). At least with a traditional definition of symbiosis, it's not the case that all interactions with the symbiont would be symbiotic interactions. For example, if my gene products enable some defense against a parasite, that isn't part of the general parasitism process. But it could be considered regulating the activities that are part of the parasitism.

but there is also a problem here:

id: GO:0070944
name: neutrophil mediated killing of bacterium
is_a: GO:0070943 ! neutrophil mediated killing of symbiont cell
...

AFAIK it's only a symbiont if there is some kind of long-term interaction. Is it considered a parasite if it's a short term opportunistic infection? I suppose arguments can be made that all bacteria on a host can be regarded somewhere on the spectrum commensal/mutual/parasite regardless of duration of interaction. But this then renders the concept of symbiont to be so broad there is little point having a distinction between symbiont process and the parent 'interspecies interaction between organisms', the only differentia is one of relative size.

pgaudet commented 6 years ago

From @ValWood on March 29, 2018 0:16

yes, but it should be possible to annotate a process to the host if it is an "evolved function". Humans aren't "programmed" to replicate viruses whatever the relationship between the two...

cmungall commented 3 years ago

It's not clear why this was moved from the annotation tracker to the ontology tracker. What is the change in the ontology that will lead to resolution here? Can we discuss this on a multi-species call?

This seems like an annotation or a mapping issue

Just focusing on human, genes that are involved in symbiont process:

http://amigo.geneontology.org/amigo/search/annotation?q=*:*&fq=isa_partof_closure:%22GO:0044403%22&sfq=document_category:%22annotation%22

(I use the term involved in in the strict gp2term sense)

image

There is a lot of pretty vague (not very useful?) annotations coming in from SP annotations to:

https://www.uniprot.org/keywords/KW-0945 Host-virus interaction ==>

id: GO:0016032
name: viral process
namespace: biological_process
alt_id: GO:0022415
def: "A multi-organism process in which a virus is a participant. The other participant is the host. Includes infection of a host cell, replication of the viral genome, and assembly of progeny virus particles. In some cases the viral genetic material may integrate into the host genome and only subsequently, under particular circumstances, 'complete' its life cycle." [GOC:bf, GOC:jl, GOC:mah]
comment: See also the biological process terms 'viral infectious cycle ; GO:0019058' and 'lysogeny ; GO:0030069'.
subset: goslim_metagenomics
subset: goslim_pir
synonym: "viral infection" RELATED []
synonym: "virulence" RELATED []
synonym: "virus process" EXACT [GOC:bf, GOC:jl]
xref: Wikipedia:Viral_life_cycle
is_a: GO:0044403 ! symbiotic process

Are we sure this mapping is valid? Or maybe a link from the vague 'viral process' to symbiotic process is not valid? The combination seems to weaken the concept of 'symbiont process' to be less useful? Is the virus even in an 'intimate' relationship with its host? I would appreciate the opinions of viral ecologists here.

Other annotations seem to be cases of hijacked host functions, e.g from:

http://amigo.geneontology.org/amigo/reference/PMID:21501828 http://amigo.geneontology.org/amigo/reference/PMID:14963118

Are these meaningfully symbiotic?

mgiglio99 commented 3 years ago

So, I think a lot of confusion got introduced when the original PAMGO "symbiosis" term got changed to "symbiont process" - which was not the same concept at all. That change broke a lot of the relationships that PAMGO had established. What it should have been, and has since been changed to, is "symbiotic process". Both "interaction with host" and "interaction with symbiont" should be is_a children of "symbiotic process". However, currently "interaction with symbiont" is not - that one is messed up and fixing that is in my proposed changes to this node that is documented in https://github.com/geneontology/go-ontology/issues/19431 That has been on the multiorganism to-do list for a long time (along with a lot of other stuff.)

@cmungall , above you flip back and forth between saying "symbiont process" and "symbiotic process" - there is no "symbiont process" anymore so we need to be careful in talking about these. Those two terms do not mean the same thing at all. But as I said above,the fact that "symbiont process" ever existed where "symbiosis" had been, messed a lot of things up.

On Chris's specific Keyword mapping question: I don't think "host-virus interaction" should be mapped to "viral process". I'm sure that results in a lot of host proteins being erroneously given annotations for viral things. Part of the problem is that some terms got rearranged and added during the time that "symbiont process" existed - but then, as I said above, "symbiont process" became "symbiotic process" (which is what is should be) causing yet more confusion.

Another problem is that the viral terms were developed in a whole separate effort from the PAMGO terms - so there are clashes and inconsistencies there.

Regarding the "viral process" term specifically, I actually don't think this term should exist - it would be like having a term "bacterial process" or "plant process". I'm guessing that the viral terms under "viral process" should have homes in the "regular" process nodes or under "interaction with host". But I haven't really explored them enough to say for sure.

mgiglio99 commented 3 years ago

Tagging @nsuvarnaiari and @dsiegele so they can be part of the thread.

cmungall commented 3 years ago

@mgiglio99:

pgaudet commented 3 years ago

This is now resolved by having 'biological process involved in symbiotic interaction'