Open pgaudet opened 5 years ago
I found this page: http://wiki.geneontology.org/index.php/Annotation_guidelines_for_annotating_complexes_as_annotation_objects
Should we update this?
Sure, please do ! It dates form 2015.
New annotation rules and annotation reviews required:
More details in mtg minutes:
General notes:
General actions:
1. Guidelines for annotation: GP1 part_of complex:
2. [GP1] | “x complex binding”
Usage of GO:0044877 protein-containing complex binding (as of 18/4/19) – incl all children:
Annotation review:
3. colocalizes_with:
We decided that this qualifier should no longer be used with protein-containing complex
[ ] accept New Rule GORULE:0000035 for: "Colocalizes_with qualifier not allowed with protein-containing complex (GO:0032991) and children."
Usage of colocalizes_with GO:0032991 protein-containing complex (as of 18/4/19) – incl children:
Annotation review:
PROTEIN-CONTAINING COMPLEX BINDING:
Usage of GO:0044877 protein-containing complex binding (as of 18/4/19) – exact term:
4. IPI: GP1-Cpx1 or Cpx2-Cpx3: If we know GP1 binds Cpx1 or Cpx2 binds Cpx3 (e.g. complex is identified as a whole or binding is via a composite binding site):
GP1 | protein-containing complex binding | IPI | with/from "CP AC for Cpx1" has_direct_input "CP AC of Cpx1" or CP AC of Cpx2 | protein-containing complex binding | IPI | CP AC of Cpx3 has_direct_input " CP AC of Cpx3"
The complex AC goes in the AE because the AE extends the term and contains the physiological partner of the GP (AE is optional but longterm should be systematically added).
[ ] New Rule 1: When annotating a GP or complex to protein-containing complex binding by IPI the CP AC of the complex that is being bound to MUST be added in the with/from field. Optionally, the CP AC can additionally be added to the AE with qualifier has_direct_input.
5. IPI: GP1-[GP2 part_of Cpx1]: If we know GP1 binds GP2 where GP2 is part of Cpx1 (e.g. complex component(s) is/are identified or binary link has been shown but complex evidence is also in the experiment):
GP1 | protein-containing complex binding | IPI | with/from "UniProt/MOD AC of GP2 " has_direct_input "CP AC of Cpx1"
with/from could also contain (a list of) binding partners from various sources, incl. ChEBI or CP ACs if binding a subcomplex.
with/from should be as specific to the immediate binding partner(s) as possible.
The complex AC goes in the AE because the AE extends the term and contains the physiological partner of the GP (AE is optional but longterm should be systematically added).
[ ] New Rule 2: If a GP binds to another GP (GP2 or a list of GPs) that has been identified as part of a complex, the UniProt/MOD/ChEBI AC/CP AC of GP2 (or the list) must be entered in the with/from field. GP2(s) must also be directly annotated to Cpx1 (see New Rule above). Optionally, the CP AC can additionally be added to the AE with qualifier has_direct_input.
Annotation review (pts 4 & 5): 368 IPI annotations, only 20 with AEs:
[ ] update GO complex terms in with/from and AE to CP ACs (other types of AEs will remain as is).
[ ] add CP ACs in AE where missing. (too much work?)
[ ] review 1 annotation (without AE) between 2 complexes: curiously it’s between the same complex (BHF-UCL).
[ ] review if the binding partners (GP2s) are directly annotated to the known complex and add annotations if missing.
[ ] @pgaudet to raise review ticket
Note: for the 20 annotations with AEs, with/from either contains UniProt or CP AC; the others have a variety of ACs
Note: AEs have a mix of has_(direct)_input “isoform AC/GO complex term/CP AC”, occurs_in “cell type or GO CC[location]”, part_of “GO BP”, happens_during “GO BP”)
6. ISS/ISO:
GP1(sp2) | protein-containing complex binding | ISS/ISO | with/from "GP1(sp1)” has_direct_input "CP AC Cpx1(sp2)" or Cpx2(sp2) | protein-containing complex binding | ISS/ISO | with/from "Cpx2(sp1)” has_direct_input "CP AC Cpx3(sp2)"
The complex AC in the annotation extension relates to the annotation object as GP1(sp2)/Cpx2(sp2) is the one binding Cpx1(sp2)/Cpx3(sp2).
The complex AC goes in the AE because the AE extends the term and contains the physiological partner of the GP [AE is optional but longterm should be systematically added].
[ ] New Rule 3: When inferring protein-containing complex binding by ISS/ISO the CP AC of the complex that is being bound to MUST be added in the annotation extension. The species of the annotated GP/Cpx and its complex binding partner must be the same.
Annotation review: 177 ISS annotations (42 with AEs) and 426 ISO annotations (no AEs):
Note:
Other evidence codes used but not yet discussed:
7. 291 IDA annotations:
If IDA use is valid:
Annotation review:
8. 13 IMP annotations:
Annotation review:
9. 1 IC annotation:
Annotation review:
10. 1 TAS annotation:
Annotation review:
I hope that's all. Please discuss!
Birgit
Noting from conversation with @pgaudet and @dougli1sqrd , we are currently holding on further action here.
From the 2019-04 GOC meeting in Cambridge, these are new proposed rules for annotation to Protein Complex binding: (NOTE: CP : Complex portal)
@bmeldal and @pgaudet to formalize these rules