RTXteam / RTX-KG2

Build system for the RTX-KG2 biomedical knowledge graph, part of the ARAX reasoning system (https://github.com/RTXTeam/RTX)
MIT License
34 stars 9 forks source link

Bazillion Cardiolipin Biosynthesis pathways #361

Open dkoslicki opened 5 months ago

dkoslicki commented 5 months ago

I noticed that there are a ton of pathways with names like:

Cardiolipin Biosynthesis CL(i-13:0/i-16:0/i-17:0/a-15:0),Disease
Cardiolipin Biosynthesis CL(a-13:0/i-22:0/i-13:0/i-20:0),Disease
Cardiolipin Biosynthesis CL(a-13:0/i-15:0/18:2(9Z,11Z)/a-21:0)
Cardiolipin Biosynthesis CL(a-13:0/a-25:0/i-22:0/i-19:0),Disease
Cardiolipin Biosynthesis CL(22:5(7Z,10Z,13Z,16Z,19Z)/22:5(4Z,7Z,10Z,13Z,16Z)/22:5(7Z,10Z,13Z,16Z,19Z)/22:5(4Z,7Z,10Z,13Z,16Z))
Cardiolipin Biosynthesis CL(a-13:0/i-13:0/i-17:0/i-18:0),Disease
Cardiolipin Biosynthesis CL(i-12:0/i-12:0/a-25:0/i-22:0),Disease
Cardiolipin Biosynthesis CL(i-13:0/i-17:0/a-25:0/i-24:0),Disease
Cardiolipin Biosynthesis CL(18:1(11Z)/16:1(9Z)/18:1(9Z)/22:5(7Z,10Z,13Z,16Z,19Z))
Cardiolipin Biosynthesis CL(i-12:0/a-13:0/a-17:0/i-19:0),Disease

Note: each of these is resolved by our synonym lookup, in case you want CURIES (like SMPDB:SMP0058385). Two things stick out to me:

  1. That Disease is appended and
  2. That it looks like the CL... stuff is a metabolite/chemical name appended to the end of it

Marking as sar-look as I don't know how he wants to triage this one

dkoslicki commented 5 months ago

And by bazillions, I mean at least 23,279 such examples in KG2.8.4