geneontology / go-ontology

Source ontology files for the Gene Ontology
http://geneontology.org/page/download-ontology
Creative Commons Attribution 4.0 International
220 stars 40 forks source link

Are more terms needed to describe role of lncRNAs? #23028

Closed RLovering closed 2 years ago

RLovering commented 2 years ago

Edited to add proposed annotations for each lncRNA

In progress - Hi

Figure 2 of PMID:31048766 describes 13 functions of lncRNAs. If the MFs for these RNAs are not available then other terms will be used that are perhaps not appropriate or no MF will be assigned. Based on figure 2 I have tried to find MF terms that could be used for each description and noted the missing terms below. Also to work out how many of these different roles would be captured with the same GO terms and which would use different terms and perhaps need NTRs.

Proposed annotations: MF: GO:0140707 chromatin-nuclear membrane anchor activity : changed this to GO:0140463 chromatin-protein adaptor activity BF: GO:0060820 inactivation of X chromosome by heterochromatin formation


No proposed MF annotation: Since the specific action is unknown, all we can do is annotate to the process (chromatin remodeling or a child).


Proposed annotations: MF: GO:0106260 DNA-DNA tethering activity BP: GO:0006325 chromatin organization


Proposed annotations: MF: GO:0140585 promoter-enhancer loop anchoring activity BP: GO:0140588 chromatin looping


Proposed annotations: GO:0030674 protein-macromolecule adaptor activity + appropriate inputs


Proposed annotations:



Proposed:


Proposed: GO:0140693 molecular condensate scaffold activity


Proposed: Since the mechanism is not known, cannot annotate MF


Proposed:


Proposed:


Proposed:


pgaudet commented 2 years ago

I've added this to the next miRNA call

hattrill commented 2 years ago

tagging @gantonazzo and @ValWood as well

RLovering commented 2 years ago

OK I have only just started trying to look at this and problably many of you have a better idea of what terms are needed, maybe I should stop trying to do the whole list and leave each of you to add a comment for each type. If I create a new conversation section for each set then everyone could edit the actual text? Or would it be better to do this in a google doc? I think there are 3 main areas covered in Fig 2 LncRNAs with a role in Chromatin organisation, LncRNAs with a role in transcription, LncRNAs with a role post-transcriptionally @colinlog

RLovering commented 2 years ago

LncRNAs with a role in Chromatin organisation

hattrill commented 2 years ago

That sounds like a good idea, then we can collect examples as we go.

do you mean to do this here or in google?

Here, for now, as we go back and forth, but when we get some soild direction, we should make an annotation doc.

RLovering commented 2 years ago

LncRNAs with a role in regulation of transcription

RLovering commented 2 years ago

LncRNAs with a role post-transcriptionally

hattrill commented 2 years ago

Just trying to list other ways in which we can distinguish activities - not mutually exclusive, but to help thinking about it all as from Ruth's list, their activities seem almost as diverse as protein's are....

RLovering commented 2 years ago

Helen this is a nice way to look at this too

colinlog commented 2 years ago

Interesting gene regulatory topic these LncRNAs. Glad that S.pombe also has the protein, makes me believe the whole more...

My take has been that each of these LncRNAs has its own story to tell and the whole is rather idiosyncratic. Xist, Hotair, SRA, etc.. are much studied RNAs, a lot of hot attention has gone to a polycomb H3K27methylation directed by associated RNAs theory, but again, not easy to model or understand or even demonstrate convincingly...

In addition there is also a ribosomal promoter DNA-RNA basepair phenomenon -- initially by Ingrid Grummt that cannot be dismissed as an artifact. For a genome-wide example of RNA-DNA triplex structures see PMID:30605520

Still, to my knowledge no-one could biochemically reconstitute what they do, what the molecular activities really are? The sponge activities are perhaps the best documented? Furthermore there are 'regulators of the regulators'. For instance, Xist has an antagonist TSIX...

Ultimately, there are proteins involved, so those may bear the specific biochemical-molecular function (bind RNA and then do ...), whilst the RNAs have more of an adaptor-target specifying role or a phase transition initiation/maintenance role?

colinlog commented 2 years ago

OK I have only just started trying to look at this and problably many of you have a better idea of what terms are needed, maybe I should stop trying to do the whole list and leave each of you to add a comment for each type. If I create a new conversation section for each set then everyone could edit the actual text? Or would it be better to do this in a google doc? I think there are 3 main areas covered in Fig 2 LncRNAs with a role in Chromatin organisation, LncRNAs with a role in transcription, LncRNAs with a role post-transcriptionally @colinlog

Yes @RLovering , I see much sense in your proposal for organizing the process regulation aspects by LncRNAs in terms of (i) regulation - induction of chromatin organisation / histone or DNA modification (ii) regulation of transcription with of course iia initiation, iib elongation iic termination of transcription and (iii) the post-transcriptional pathways, that modulate iiia RNA splicing, iiib RNA stability, iiic RNA sub-nuclear localisation export/re-import or iiid translation by ribosomes. I think then that the STAT3 example would be a (iia) type regulation. Xist a (i) type, and teh Lnc-p21 example iiid. The trick being to have product oriented definitions and logic for the BPs and RNA molecule or RNA binding protein activity-centered definitions for the MFs, so . Most of this is already in place for proteins, isn't it?

@hattrill The cell components, are where it happens (Cell component or complex of multiple proteins/RNAs/DNAs), rather than activities? "This MF for that BP in this CC" and in GO-CAMS this RNA MF has + or - effect on the next bioentity's MF

pgaudet commented 2 years ago

+id: GO:0140870 +name: RNA polymerase inhibitor activity +namespace: molecular_function +def: "Binds to and stops, prevents or reduces the activity of RNA polymerase." [PMID:15300239, PMID:18313387, PMID:31048766] +is_a: GO:0004857 ! enzyme inhibitor activity +property_value: term_tracker_item https://github.com/geneontology/go-ontology/issues/23028 xsd:anyURI +created_by: pg +creation_date: 2022-07-04T14:26:28Z + +[Term] +id: GO:0140871 +name: repressor of RNA polymerase inhibitor activity +namespace: molecular_function +def: "Binds to and stops, prevents or reduces the activity of an RNA polymerase inhibitor." [PMID:28475895, PMID:31048766] +is_a: GO:0004857 ! enzyme inhibitor activity +property_value: term_tracker_item https://github.com/geneontology/go-ontology/issues/23028 xsd:anyURI +created_by: pg +creation_date: 2022-07-04T14:26:28Z

sjm41 commented 2 years ago

@pgaudet For "repressor of RNA polymerase inhibitor activity", should the parent be changed from "enzyme inhibitor activity" to "enzyme activator activity", given the activity is repressing an inhibitor of an RNApol?

pgaudet commented 2 years ago

I don't think it matches the definition of Molecular Function regulator: "A molecular function regulator that activates or increases the activity of its target via non-covalent binding that does not result in covalent modification to the target."

What do you think?

@vanaukenk @ukemi

colinlog commented 2 years ago

I would go for a regulator of activity annotation for some Lnc (if convincingly demonstrated) but not inhibitor of enzyme activity when RNA polymerase II is concerned since there are many functions that can lead to less or more transcription of a gene (rate/frequency of intiation, processivity of elongation, etc, etc.) The exception would be if the active site of the polymerase is contacted. So, global BP annotation to transcription regulator [GO:0006355 regulation of DNA-templated transcription] if that is demonstrated. Possibly with a DNA cis-acting element binding has_part [GO:0000978] RNA polymerase II cis-regulatory region sequence-specific DNA binding], if that is demonstrated. Often I would imagine that an annotation to an epigenetic process + (positive regulator eg; [GO:0045815 biological_process epigenetic maintenance of chromatin in transcription-competent conformation] or - (negative regulator eg; [GO:0045814 biological_process negative regulation of gene expression, epigenetic] or its descendant term (eg [GO:0031507 biological_process heterochromatin formation], or one of it's more defined descendant terms can be made if more is known.