Open ukemi opened 6 years ago
I think we need to start curating the location explicitly in Rhea when we have that information, and replace the "in" and "out" (conventions left over from IUBMB) with actual GO terms for subcellular locations. Actually IUBMB use a range of terms now like side 1 and side 2 which is creating even more trouble.
This would give us a reaction hierarchy that would be much easier for users to understand and exploit.
and "child" terms like
A (cytosol) = A (extracellular space) [crossing a membrane...]
A (cytosolic leaflet of plasma membrane) = A (extracellular leaflet of plasma membrane) [flipping within a membrane]
A (cytosolic leaflet of plasma membrane) = A (cytosolic leaflet of golgi membrane) [moving from one membrane to another, but staying on the cytosolic face]
(apologies for not using real GO terms here for the sake of speed)
the transport hierarchy could then be inferred based on i) ChEBI classes and ii) GO cell components:
Example of a Rhea transporter ID:
https://www.rhea-db.org/reaction?id=20621
Equivalent GO class:
id: GO:0016463
name: zinc-exporting ATPase activity
namespace: molecular_function
def: "Enables the transfer of a solute or solutes from one side of a membrane to the other according to the reaction: ATP + H2O + Zn2+(in) -> ADP + phosphate + Zn2+(out)." [EC:3.6.3.5]
synonym: "ATP phosphohydrolase (Zn2+-exporting)" EXACT [EC:3.6.3.5]
synonym: "AtHMA4" RELATED [EC:3.6.3.5]
synonym: "P(1B)-type ATPase activity" NARROW [EC:3.6.3.5]
synonym: "P1B-type ATPase activity" EXACT [EC:3.6.3.5]
synonym: "Zn(2+)-exporting ATPase activity" EXACT [EC:3.6.3.5]
synonym: "Zn(II)-translocating P-type ATPase activity" EXACT [EC:3.6.3.5]
synonym: "Zn2+-exporting ATPase activity" EXACT [EC:3.6.3.5]
synonym: "zinc exporting ATPase activity" EXACT []
synonym: "zinc-translocating P-type ATPase activity" BROAD [EC:3.6.3.5]
xref: EC:3.6.3.5
xref: MetaCyc:3.6.3.5-RXN
xref: RHEA:20621
is_a: GO:0005385 ! zinc ion transmembrane transporter activity
is_a: GO:0015662 ! ATPase activity, coupled to transmembrane movement of ions, phosphorylative mechanism
is_a: GO:0019829 ! cation-transporting ATPase activity
Another, this one where we have a logical def on the equivalent GO:
https://www.rhea-db.org/reaction?id=11348
id: GO:0015446
name: ATPase-coupled arsenite transmembrane transporter activity
namespace: molecular_function
def: "Enables the transfer of a solute or solutes from one side of a membrane to the other according to the reaction: ATP + H2O + arsenite(in) = ADP + phosphate + arsenite(out)." [EC:3.6.3.16]
synonym: "ATP phosphohydrolase (arsenite-exporting) activity" EXACT [EC:3.6.3.16]
synonym: "arsenical pump-driving ATPase activity" EXACT [EC:3.6.3.16]
synonym: "arsenical resistance ATPase activity" EXACT [EC:3.6.3.16]
synonym: "arsenical resistance efflux pump" EXACT []
synonym: "arsenite ABC transporter" NARROW []
synonym: "arsenite transporting ATPase activity" EXACT []
synonym: "arsenite-translocating ATPase activity" EXACT [EC:3.6.3.16]
synonym: "arsenite-transmembrane transporting ATPase activity" RELATED []
synonym: "arsenite-transporting ATPase activity" EXACT [EC:3.6.3.16, GOC:vw]
xref: EC:3.6.3.16
xref: MetaCyc:3.6.3.16-RXN
xref: RHEA:11348
is_a: GO:0008490 ! arsenite secondary active transmembrane transporter activity
is_a: GO:0019829 ! cation-transporting ATPase activity
is_a: GO:0043225 ! ATPase-coupled anion transmembrane transporter activity
intersection_of: GO:0042626 ! ATPase activity, coupled to transmembrane movement of substances
intersection_of: transports_or_maintains_localization_of CHEBI:22629
for https://github.com/geneontology/go-ontology/issues/15975#issuecomment-445863868 corresponding GO MF: GO:0016463 zinc-exporting ATPase activity corresponding GO BP: GO:0071577 zinc ion transmembrane transport
@ukemi please check if it can be closed
Right now, there is no OWL representation in the axioms I have generated for the "in" and "out" you see in the RHEA transport reactions. The definition for A (in) = A (out) is the same as it is for A = A. I suspect this is something we could improve on with some more thought.
A related issue came up before for the translocation reactions coming into Noctua from Reactome, many of which are currently not classified in GO. The idea there was to use the occurs_in information in Reactome to generate 'has target start/end location' statements on the reaction entities that would then work with existing axioms to get classifications into biological process terms. E.g.,
R instance_of ‘establishment of protein localization’ R has_target_end_location location2 R has_target_start_location location1 <= R has output P2, P2 occurs_in location2, R has input P1 P1 occurs_in location1 P1 = P2 location1 != location2
Then the OWL reasoner should add more specific biological process - e.g. R instance_of ‘establishment of protein localization to mitochondrial membrane’ Based on its definition: 'establishment of protein localization' and ('has target end location' some 'mitochondrial membrane')
I think the transporter / translocation area could be a really nice result from the rhea/reactome/go integration. We get the more specific information about locations from Reactome, the class structure with extant OWL definitions from GO, and the expanded set of transport reactions from rhea. I think worthy of another ticket @ukemi .