CompositionalDietaryNutritionOntology / cdno

Compositional Dietary Nutrition Ontology
Creative Commons Zero v1.0 Universal
11 stars 3 forks source link

Multiple labels for ChEBI chemicals #109

Closed ddooley closed 5 months ago

ddooley commented 1 year ago

FoodOn imports ChEBI and CDNO terms, and sometimes a chemical gets imported twice. I noticed with glycerol:

image

The middle one is ChEBI's term, the other two originate in CDNO. Can CDNO drop its hardcoded labels in favour of a ChEBI import file of the same (i.e. an Ontofox or other fetch of CHEBI content).

This came up with attempt to use robot to convert FoodOn to obo format that errors on multiple labels for a single entity.

ddooley commented 1 year ago

I think the CHEBI items in question probably are the following ones. This list of duplicates is from a query at the FoodOn side. By just relying on CHEBI names, and having any kind of label CDNO wants indicated by an "alternative term" annotation, we would solve the robot convert to obo format generation error.

<http://purl.obolibrary.org/obo/CHEBI_76132>    "(+)-catechin 3-O-gallate"@en   "(+)-catechin-3-O-gallate"
<http://purl.obolibrary.org/obo/CHEBI_9577> "(-)-alpha-thujone" "(−)-α-thujone"@en
<http://purl.obolibrary.org/obo/CHEBI_4806> "(-)-epigallocatechin 3-gallate"    "(−)-epigallocatechin 3-gallate"@en
<http://purl.obolibrary.org/obo/CHEBI_67244>    "(-)-lariciresinol" "(−)-lariciresinol"@en
<http://purl.obolibrary.org/obo/CHEBI_108>  "(-)-phaseolin" "(−)-phaseolin"@en
<http://purl.obolibrary.org/obo/CHEBI_77366>    "(6Z,9Z,12Z,15Z,18Z,21Z)-tetracosahexaenoic acid"   "6Z,9Z,12Z,15Z,18Z,21Z-tetracosahexaenoic acid"@en
<http://purl.obolibrary.org/obo/CHEBI_132141>   "1,2-naphthoquinones"   "1,2-napthoquinone"@en
<http://purl.obolibrary.org/obo/CHEBI_132142>   "1,4-naphthoquinones"   "1,4-napthoquinone"@en
<http://purl.obolibrary.org/obo/CHEBI_174910>   "4'-O-methyl-(-)-epicatechin"   "4'-O-methyl-(−)-epicatechin"@en
<http://purl.obolibrary.org/obo/CHEBI_140677>   "4-hydroxy-all-trans-beta-carotene" "4-hydroxy-all-trans-β-carotene"@en
<http://purl.obolibrary.org/obo/CHEBI_1904> "4-methylumbelliferone beta-D-glucuronide"  "4-methylumbelliferone β-D-glucuronide"@en
<http://purl.obolibrary.org/obo/CHEBI_18394>    "6-O-alpha-D-glucopyranosyl-D-fructofuranose"   "6-O-α-D-glucopyranosyl-D-fructofuranose"@en
<http://purl.obolibrary.org/obo/CHEBI_67188>    "9-cis-beta-carotene"   "9-cis-β-carotene"@en
<http://purl.obolibrary.org/obo/CHEBI_31342>    "Calcium hypochlorite"  "calcium hypochlorite"@en
<http://purl.obolibrary.org/obo/CHEBI_3815> "Collagen"  "collagen"@en
<http://purl.obolibrary.org/obo/CHEBI_4767> "Elastin"   "elastin"@en
<http://purl.obolibrary.org/obo/CHEBI_5054> "Fibrin"    "fibrin"@en
<http://purl.obolibrary.org/obo/CHEBI_141540>   "Myricanone"    "myricanone"@en
<http://purl.obolibrary.org/obo/CHEBI_167695>   "Punicalagin"   "punicalagin"@en
<http://purl.obolibrary.org/obo/CHEBI_28425>    "alpha-carotene"    "α-carotene"@en
<http://purl.obolibrary.org/obo/CHEBI_10223>    "alpha-cryptoxanthin"   "α-cryptoxanthin"@en
<http://purl.obolibrary.org/obo/CHEBI_27432>    "alpha-linolenic acid"  "α-linolenic acid"@en
<http://purl.obolibrary.org/obo/CHEBI_22470>    "alpha-tocopherol"  "α-tocopherol"@en
<http://purl.obolibrary.org/obo/CHEBI_33270>    "alpha-tocotrienol" "α-tocotrienol"@en
<http://purl.obolibrary.org/obo/CHEBI_83820>    "amino acid derived from non-proteinacious"@en  "non-proteinogenic amino acid"
<http://purl.obolibrary.org/obo/CHEBI_62967>    "amorphous cellulose"   "cellulose"@en
<http://purl.obolibrary.org/obo/CHEBI_30248>    "beta-D-Gal-(1->3)-beta-D-GlcNAc-(1->3)-beta-D-Gal-(1->4)-D-Glc"    "β-D-Gal-(1→3)-β-D-GlcNAc-(1→3)-β-D-Gal-(1→4)-D-Glc"@en
<http://purl.obolibrary.org/obo/CHEBI_22798>    "beta-D-glucoside"  "β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_17579>    "beta-carotene" "β-carotene"@en
<http://purl.obolibrary.org/obo/CHEBI_10362>    "beta-cryptoxanthin"    "β-cryptoxanthin"@en
<http://purl.obolibrary.org/obo/CHEBI_47771>    "beta-tocopherol"   "β-tocopherol"@en
<http://purl.obolibrary.org/obo/CHEBI_33275>    "beta-tocotrienol"  "β-tocotrienol"@en
<http://purl.obolibrary.org/obo/CHEBI_23086>    "chalcone"@en   "chalcones"
<http://purl.obolibrary.org/obo/CHEBI_27475>    "cyanidin 3-O-beta-D-galactoside"   "cyanidin 3-O-β-D-galactoside"@en
<http://purl.obolibrary.org/obo/CHEBI_37664>    "cyanidin 3-O-beta-D-galactoside chloride"  "cyanidin 3-O-β-D-galactoside chloride"@en
<http://purl.obolibrary.org/obo/CHEBI_74811>    "cyanidin 3-O-beta-D-sambubioside"  "cyanidin 3-O-β-D-sambubioside"@en
<http://purl.obolibrary.org/obo/CHEBI_42202>    "daidzein 7-O-beta-D-glucoside" "daidzein 7-O-β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_31463>    "delphinidin 3-O-beta-D-glucoside"  "delphinidin 3-O-β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_47772>    "delta-tocopherol"  "δ-tocopherol"@en
<http://purl.obolibrary.org/obo/CHEBI_33276>    "delta-tocotrienol" "δ-tocotrienol"@en
<http://purl.obolibrary.org/obo/CHEBI_71230>    "dihydrochalcone"@en    "dihydrochalcones"
<http://purl.obolibrary.org/obo/CHEBI_48039>    "dihydroflavanol"@en    "dihydroflavonols"
<http://purl.obolibrary.org/obo/CHEBI_28661>    "gamma-linolenic acid"  "γ-linolenic acid"@en
<http://purl.obolibrary.org/obo/CHEBI_18185>    "gamma-tocopherol"  "γ-tocopherol"@en
<http://purl.obolibrary.org/obo/CHEBI_33277>    "gamma-tocotrienol" "γ-tocotrienol"@en
<http://purl.obolibrary.org/obo/CHEBI_74224>    "genistein (1-)"@en "genistein(1-)"
<http://purl.obolibrary.org/obo/CHEBI_27514>    "genistein 7-O-beta-D-glucoside"    "genistein 7-O-β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_133348>   "glycitein 7-(6-O-acetyl-beta-D-glucoside)" "glycitein 7-(6-O-acetyl-β-D-glucoside)"@en
<http://purl.obolibrary.org/obo/CHEBI_18305>    "hydroquinone O-beta-D-glucopyranoside" "hydroquinone O-β-D-glucopyranoside"@en
<http://purl.obolibrary.org/obo/CHEBI_16382>    "iodide"    "iodine"@en
<http://purl.obolibrary.org/obo/CHEBI_37168>    "iota-carrageenan"  "ι-carrageenan"@en
<http://purl.obolibrary.org/obo/CHEBI_75832>    "iron sulfate"@en   "iron(2+) sulfate (anhydrous)"
<http://purl.obolibrary.org/obo/CHEBI_75832>    "iron sulfate"@en   "iron(2+) sulfate (anhydrous)"@en
<http://purl.obolibrary.org/obo/CHEBI_31742>    "kaempferol 3-O-beta-D-galactoside" "kaempferol 3-O-β-D-galactoside"@en
<http://purl.obolibrary.org/obo/CHEBI_31743>    "kaempferol 3-O-beta-D-glucosyl-(1->2)-beta-D-glucoside"    "kaempferol 3-O-β-D-glucosyl-(1→2)-β-D-glucosyl-(1→2)-β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_10583>    "kappa-carrageenan" "κ-carrageenan"@en
<http://purl.obolibrary.org/obo/CHEBI_37167>    "lambda-carrageenan"    "λ-carrageenan"@en
<http://purl.obolibrary.org/obo/CHEBI_6579> "luteolin 7-O-(6-O-malonyl-beta-D-glucoside)"   "luteolin 7-O-(6-O-malonyl-β-D-glucoside)"@en
<http://purl.obolibrary.org/obo/CHEBI_77791>    "luteolin 7-O-beta-D-glucoside(1-)" "luteolin 7-O-β-D-glucoside(1−)"@en
<http://purl.obolibrary.org/obo/CHEBI_31799>    "malvidin 3-O-beta-D-glucoside" "malvidin 3-O-β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_18291>    "manganese atom"    "manganese(2+)"@en
<http://purl.obolibrary.org/obo/CHEBI_25481>    "naphthoquinone"    "napthoquinone"@en
<http://purl.obolibrary.org/obo/CHEBI_28327>    "naringenin 7-O-beta-D-glucoside"   "naringenin 7-O-β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_25681>    "omega-3 fatty acid"    "ω−3 fatty acid"@en
<http://purl.obolibrary.org/obo/CHEBI_36009>    "omega-6 fatty acid"    "ω−6 fatty acid"@en
<http://purl.obolibrary.org/obo/CHEBI_25704>    "organic sulfate"   "organic sulfate salt"@en
<http://purl.obolibrary.org/obo/CHEBI_36122>    "pelargonidin 3-O-beta-D-glucoside chloride"    "pelargonidin 3-O-β-D-glucoside chloride"@en
<http://purl.obolibrary.org/obo/CHEBI_74793>    "peonidin 3-O-beta-D-glucoside" "peonidin 3-O-β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_31985>    "petunidin 3-O-beta-D-glucoside"    "petunidin 3-O-β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_33468>    "phosphide"@en  "phosphide(3-)"
<http://purl.obolibrary.org/obo/CHEBI_26125>    "phytosterol"@en    "phytosterols"
<http://purl.obolibrary.org/obo/CHEBI_190301>   "potassium phosphate"@en    "tripotassium phosphate"
<http://purl.obolibrary.org/obo/CHEBI_8454> "progoitrin(1-)"    "progoitrin(1−)"@en
<http://purl.obolibrary.org/obo/CHEBI_131498>   "quercetin 3,4'-di-O-beta-D-glucoside"  "quercetin 3,4'-di-O-β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_67486>    "quercetin 3-O-beta-D-galactopyranoside"    "quercetin 3-O-β-D-galactopyranoside"@en
<http://purl.obolibrary.org/obo/CHEBI_68352>    "quercetin 3-O-beta-D-glucopyranoside"  "quercetin 3-O-β-D-glucopyranoside"@en
<http://purl.obolibrary.org/obo/CHEBI_32082>    "quercetin 3-O-beta-D-glucosyl-(1->2)-beta-D-glucoside" "quercetin 3-O-β-D-glucosyl-(1→2)-β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_75839>    "quercetin 4'-O-beta-D-glucopyranoside" "quercetin 4ʼ-O-β-D-glucopyranoside"@en
<http://purl.obolibrary.org/obo/CHEBI_131499>   "quercetin 7,4'-di-O-beta-D-glucoside"  "quercetin 7,4'-di-O-β-D-glucoside"@en
<http://purl.obolibrary.org/obo/CHEBI_145013>   "rebaudioside B (1-)"@en    "rebaudioside B(1-)"
<http://purl.obolibrary.org/obo/CHEBI_26658>    "sesquiterpenoid"   "sesquiterpoid"@en
<http://purl.obolibrary.org/obo/CHEBI_15138>    "sulfide(2-)"   "sulfide(2−)"@en
LilyAndres commented 1 year ago

Thanks @ddooley, apologies for my late reply. I was away.

I have checked the terms and indeed some of them have a different label than what it's described in the original cheri labels. However, some of them had the same label as chebi and I'm not sure why they were highlighted as duplicated when you ran the code in FoodOn. I have fixed all of them but it would be great if you can run the command again and let me know if there are any issues.

I'm sure we have the CHEBI terms as the ones that we want to use and we added "ExactSynonym" to describe alternative names...

Screen Shot 2022-11-16 at 5 50 18 pm
ddooley commented 1 year ago

The problem is that you have AL rdfs:label@en at column top? CHEBI uses ^^xml:string datatype instead for their labels. That might fix duplication problem. OBO format doesn't like for a term to have both rdfs:label@en and rdfs:label^^xml:string .

LilyAndres commented 1 year ago

Thanks @ddooley. I have changed the label datatype in the 'nutritional_components_framework.csv' template to AT rdfs:label^^xml:string:

Screen Shot 2022-11-17 at 12 14 27 pm

Is looking something like this:

Screen Shot 2022-11-17 at 11 53 27 am

I hope is ok, please let me know if not and I will make the corresponding changes. Thanks a lot.

ddooley commented 1 year ago

So first, I made a wee mistake - for ChEBI it should be xsd:string, not xml:string. Sorry about that!

Now CDNO still has some duplicates where you've entered a new character, like for alpha-carotene? we actually have to get to the place where every term has exactly 1 label. From OBO principles:

use rdfs:label for the primary label
include exactly one rdfs:label for every declared entity (e.g. class, property)

So in CDNO e.g. this has to be fixed:

image

the label should match ChEBI's alpha-carotene^^xsd:string label exactly.

alpha-carotene And the **α-carotene** label must be switched to say "alternative term" IAO_0000118 annotation, or a synonym.
ddooley commented 1 year ago

This is the list of label duplicates, requeried.

http://purl.obolibrary.org/obo/CHEBI_81377 "(+)-nootkatone" http://purl.obolibrary.org/obo/CHEBI_17948 "(+)-taxifolin" http://purl.obolibrary.org/obo/CHEBI_123 "(-)-Variabilin" http://purl.obolibrary.org/obo/CHEBI_90236 "(2E,6E)-farnesyl monophosphate" http://purl.obolibrary.org/obo/CHEBI_41941 "(3,4-dihydroxyphenyl)acetic acid" http://purl.obolibrary.org/obo/CHEBI_15396 "(R)-camphor" http://purl.obolibrary.org/obo/CHEBI_50371 "(R)-rosmarinic acid" http://purl.obolibrary.org/obo/CHEBI_17846 "(S)-naringenin" http://purl.obolibrary.org/obo/CHEBI_4806 "(-)-epigallocatechin 3-gallate" http://purl.obolibrary.org/obo/CHEBI_67244 "(-)-lariciresinol" http://purl.obolibrary.org/obo/CHEBI_108 "(-)-phaseolin" http://purl.obolibrary.org/obo/CHEBI_9577 "(-)-alpha-thujone" http://purl.obolibrary.org/obo/CHEBI_27961 "1,8-cineole" http://purl.obolibrary.org/obo/CHEBI_27479 "2'-hydroxydaidzein" http://purl.obolibrary.org/obo/CHEBI_17678 "2'-hydroxyformononetin" http://purl.obolibrary.org/obo/CHEBI_18026 "2,3-dihydroxybenzoic acid" http://purl.obolibrary.org/obo/CHEBI_165217 "2,4,6-Trihydroxybenzoic acid" http://purl.obolibrary.org/obo/CHEBI_17189 "2,5-dihydroxybenzoic acid" http://purl.obolibrary.org/obo/CHEBI_68465 "2,6-dihydroxybenzoic acid" http://purl.obolibrary.org/obo/CHEBI_17407 "2-trans,6-trans-farnesyl diphosphate" http://purl.obolibrary.org/obo/CHEBI_50399 "3',4',7-trihydroxyisoflavone" http://purl.obolibrary.org/obo/CHEBI_59979 "3',5'-di-O-methyltricetin" http://purl.obolibrary.org/obo/CHEBI_36062 "3,4-dihydroxybenzoic acid" http://purl.obolibrary.org/obo/CHEBI_1387 "3,4-dihydroxyphenylethyleneglycol" http://purl.obolibrary.org/obo/CHEBI_28647 "3-O-methylgallic acid" http://purl.obolibrary.org/obo/CHEBI_63798 "3-hydroxy-4-methoxybenzoic acid" http://purl.obolibrary.org/obo/CHEBI_30764 "3-hydroxybenzoic acid" http://purl.obolibrary.org/obo/CHEBI_18404 "3-methylcatechol" http://purl.obolibrary.org/obo/CHEBI_28631 "3-phenylpropionic acid" http://purl.obolibrary.org/obo/CHEBI_16514 "4',5,7-trihydroxy-3'-methoxyflavone" http://purl.obolibrary.org/obo/CHEBI_174910 "4'-O-methyl-(-)-epicatechin" http://purl.obolibrary.org/obo/CHEBI_140677 "4-hydroxy-all-trans-beta-carotene" http://purl.obolibrary.org/obo/CHEBI_30763 "4-hydroxybenzoic acid" http://purl.obolibrary.org/obo/CHEBI_17254 "4-methylcatechol" http://purl.obolibrary.org/obo/CHEBI_1904 "4-methylumbelliferone beta-D-glucuronide" http://purl.obolibrary.org/obo/CHEBI_18394 "6-O-alpha-D-glucopyranosyl-D-fructofuranose" http://purl.obolibrary.org/obo/CHEBI_28184 "7-ethoxycoumarin" http://purl.obolibrary.org/obo/CHEBI_67188 "9-cis-beta-carotene" http://purl.obolibrary.org/obo/CHEBI_2961 "Azukisaponin III" http://purl.obolibrary.org/obo/CHEBI_3429 "Carnosol" http://purl.obolibrary.org/obo/CHEBI_3461 "Casuarictin" http://purl.obolibrary.org/obo/CHEBI_3462 "Casuarinin" http://purl.obolibrary.org/obo/CHEBI_3584 "Chebulinic acid" http://purl.obolibrary.org/obo/CHEBI_3942 "Cucurbitacin C" http://purl.obolibrary.org/obo/CHEBI_4019 "Cyclokievitone" http://purl.obolibrary.org/obo/CHEBI_4308 "Dalbergin" http://purl.obolibrary.org/obo/CHEBI_5087 "Flavoxanthin" http://purl.obolibrary.org/obo/CHEBI_5180 "Fucofuroeckol B" http://purl.obolibrary.org/obo/CHEBI_5328 "Geraniin" http://purl.obolibrary.org/obo/CHEBI_5920 "Ineketone" http://purl.obolibrary.org/obo/CHEBI_5972 "Irisolidone" http://purl.obolibrary.org/obo/CHEBI_5985 "Isobetanin" http://purl.obolibrary.org/obo/CHEBI_7037 "Mutatochrome" http://purl.obolibrary.org/obo/CHEBI_7623 "Norbixin" http://purl.obolibrary.org/obo/CHEBI_7713 "Obacunone" http://purl.obolibrary.org/obo/CHEBI_7948 "Pedunculagin" http://purl.obolibrary.org/obo/CHEBI_8046 "Phaseoloside D" http://purl.obolibrary.org/obo/CHEBI_8183 "Physalien" http://purl.obolibrary.org/obo/CHEBI_8186 "Physoperuvine" http://purl.obolibrary.org/obo/CHEBI_8367 "Prebetanin" http://purl.obolibrary.org/obo/CHEBI_167696 "Punicalin" http://purl.obolibrary.org/obo/CHEBI_8870 "Rishitin" http://purl.obolibrary.org/obo/CHEBI_9045 "Sayanedine" http://purl.obolibrary.org/obo/CHEBI_9048 "Schisantherin A" http://purl.obolibrary.org/obo/CHEBI_9151 "Sinalbin" http://purl.obolibrary.org/obo/CHEBI_9228 "Spinasaponin A" http://purl.obolibrary.org/obo/CHEBI_9433 "Tellimagrandin I" http://purl.obolibrary.org/obo/CHEBI_9454 "Terminalin" http://purl.obolibrary.org/obo/CHEBI_9517 "Theasaponin" http://purl.obolibrary.org/obo/CHEBI_9647 "Trachelogenin" http://purl.obolibrary.org/obo/CHEBI_80441 "Tulipanin" http://purl.obolibrary.org/obo/CHEBI_10086 "Yamogenin" http://purl.obolibrary.org/obo/CHEBI_2366 "absinthin" http://purl.obolibrary.org/obo/CHEBI_581177 "agrimoniin" http://purl.obolibrary.org/obo/CHEBI_32446 "all-trans-neoxanthin" http://purl.obolibrary.org/obo/CHEBI_8191 "all-trans-phytoene" http://purl.obolibrary.org/obo/CHEBI_17898 "all-trans-retinal" http://purl.obolibrary.org/obo/CHEBI_35288 "all-trans-violaxanthin" http://purl.obolibrary.org/obo/CHEBI_27867 "antheraxanthin" http://purl.obolibrary.org/obo/CHEBI_16778 "apigenin 7-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_40968 "astaxanthin" http://purl.obolibrary.org/obo/CHEBI_38200 "astilbin" http://purl.obolibrary.org/obo/CHEBI_27408 "avenacin A-1" http://purl.obolibrary.org/obo/CHEBI_27679 "betagarin" http://purl.obolibrary.org/obo/CHEBI_3087 "betulinic acid" http://purl.obolibrary.org/obo/CHEBI_17574 "biochanin A" http://purl.obolibrary.org/obo/CHEBI_28093 "borneol" http://purl.obolibrary.org/obo/CHEBI_17793 "calycosin" http://purl.obolibrary.org/obo/CHEBI_3362 "canthaxanthin" http://purl.obolibrary.org/obo/CHEBI_3375 "capsanthin" http://purl.obolibrary.org/obo/CHEBI_3378 "capsorubin" http://purl.obolibrary.org/obo/CHEBI_23042 "carotene" http://purl.obolibrary.org/obo/CHEBI_69304 "cinnamtannin B-1" http://purl.obolibrary.org/obo/CHEBI_17395 "cis-caffeic acid" http://purl.obolibrary.org/obo/CHEBI_18211 "citrulline" http://purl.obolibrary.org/obo/CHEBI_16547 "coniferyl aldehyde" http://purl.obolibrary.org/obo/CHEBI_28794 "coumarin" http://purl.obolibrary.org/obo/CHEBI_3918 "crocetin" http://purl.obolibrary.org/obo/CHEBI_27475 "cyanidin 3-O-beta-D-galactoside" http://purl.obolibrary.org/obo/CHEBI_37664 "cyanidin 3-O-beta-D-galactoside chloride" http://purl.obolibrary.org/obo/CHEBI_74811 "cyanidin 3-O-beta-D-sambubioside" http://purl.obolibrary.org/obo/CHEBI_3978 "cyanin" http://purl.obolibrary.org/obo/CHEBI_42202 "daidzein 7-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_31463 "delphinidin 3-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_38701 "delphinidin chloride" http://purl.obolibrary.org/obo/CHEBI_78802 "diarylheptanoid" http://purl.obolibrary.org/obo/CHEBI_4746 "echinenone" http://purl.obolibrary.org/obo/CHEBI_34741 "equol" http://purl.obolibrary.org/obo/CHEBI_67939 "erythrodiol" http://purl.obolibrary.org/obo/CHEBI_4853 "esculin" http://purl.obolibrary.org/obo/CHEBI_4932 "eupatilin" http://purl.obolibrary.org/obo/CHEBI_42567 "fisetin" http://purl.obolibrary.org/obo/CHEBI_38672 "flavans" http://purl.obolibrary.org/obo/CHEBI_28802 "flavonols" http://purl.obolibrary.org/obo/CHEBI_18088 "formononetin" http://purl.obolibrary.org/obo/CHEBI_5186 "fucoxanthin" http://purl.obolibrary.org/obo/CHEBI_5262 "galangin" http://purl.obolibrary.org/obo/CHEBI_27587 "garbanzol" http://purl.obolibrary.org/obo/CHEBI_27514 "genistein 7-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_74224 "genistein(1-)" http://purl.obolibrary.org/obo/CHEBI_17447 "geraniol" http://purl.obolibrary.org/obo/CHEBI_10136 "gingerol" http://purl.obolibrary.org/obo/CHEBI_77152 "ginsenoside Rb2" http://purl.obolibrary.org/obo/CHEBI_77151 "ginsenoside Rg2" http://purl.obolibrary.org/obo/CHEBI_5396 "glucoberteroin(1-)" http://purl.obolibrary.org/obo/CHEBI_64962 "glucobrassicin(1-)" http://purl.obolibrary.org/obo/CHEBI_5399 "glucocapparin(1-)" http://purl.obolibrary.org/obo/CHEBI_5402 "glucocochlearin(1-)" http://purl.obolibrary.org/obo/CHEBI_5411 "gluconapin(1-)" http://purl.obolibrary.org/obo/CHEBI_5414 "glucoputranjivin(1-)" http://purl.obolibrary.org/obo/CHEBI_5416 "glucoraphenin(1-)" http://purl.obolibrary.org/obo/CHEBI_24279 "glucosinolate" http://purl.obolibrary.org/obo/CHEBI_133348 "glycitein 7-(6-O-acetyl-beta-D-glucoside)" http://purl.obolibrary.org/obo/CHEBI_30853 "glycyrrhetinic acid" http://purl.obolibrary.org/obo/CHEBI_5623 "harman" http://purl.obolibrary.org/obo/CHEBI_5764 "hordenine" http://purl.obolibrary.org/obo/CHEBI_5769 "humulone" http://purl.obolibrary.org/obo/CHEBI_18305 "hydroquinone O-beta-D-glucopyranoside" http://purl.obolibrary.org/obo/CHEBI_24697 "hydroxyflavanone" http://purl.obolibrary.org/obo/CHEBI_24698 "hydroxyflavone" http://purl.obolibrary.org/obo/CHEBI_5970 "irilone" http://purl.obolibrary.org/obo/CHEBI_18224 "isoeugenol" http://purl.obolibrary.org/obo/CHEBI_310312 "isoliquiritigenin" http://purl.obolibrary.org/obo/CHEBI_17965 "isoorientin" http://purl.obolibrary.org/obo/CHEBI_28853 "isopimpinellin" http://purl.obolibrary.org/obo/CHEBI_6052 "isorhamnetin" http://purl.obolibrary.org/obo/CHEBI_6099 "kaempferide" http://purl.obolibrary.org/obo/CHEBI_30200 "kaempferol 3-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_75721 "kaempferol 3-O-glucuronide" http://purl.obolibrary.org/obo/CHEBI_31742 "kaempferol 3-O-beta-D-galactoside" http://purl.obolibrary.org/obo/CHEBI_31743 "kaempferol 3-O-beta-D-glucosyl-(1->2)-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_6357 "lactucaxanthin" http://purl.obolibrary.org/obo/CHEBI_28620 "licoisoflavone A" http://purl.obolibrary.org/obo/CHEBI_17219 "limonene-1,2-diol" http://purl.obolibrary.org/obo/CHEBI_16226 "limonin" http://purl.obolibrary.org/obo/CHEBI_28777 "liquiritigenin" http://purl.obolibrary.org/obo/CHEBI_27774 "lubimin" http://purl.obolibrary.org/obo/CHEBI_6553 "lucenin-2" http://purl.obolibrary.org/obo/CHEBI_6579 "luteolin 7-O-(6-O-malonyl-beta-D-glucoside)" http://purl.obolibrary.org/obo/CHEBI_27994 "luteolin 7-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_18128 "luteolin 7-O-beta-D-glucosiduronic acid" http://purl.obolibrary.org/obo/CHEBI_31788 "luteolin 7-O-neohesperidoside" http://purl.obolibrary.org/obo/CHEBI_77791 "luteolin 7-O-beta-D-glucoside(1-)" http://purl.obolibrary.org/obo/CHEBI_6602 "lycoxanthin" http://purl.obolibrary.org/obo/CHEBI_31799 "malvidin 3-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_38756 "methoxyisoflavone" http://purl.obolibrary.org/obo/CHEBI_66395 "miquelianin" http://purl.obolibrary.org/obo/CHEBI_25459 "myrtenic acid" http://purl.obolibrary.org/obo/CHEBI_10607 "n-propyl gallate" http://purl.obolibrary.org/obo/CHEBI_28327 "naringenin 7-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_7609 "nomilin" http://purl.obolibrary.org/obo/CHEBI_18257 "ornithine" http://purl.obolibrary.org/obo/CHEBI_69437 "orobol" http://purl.obolibrary.org/obo/CHEBI_7889 "paeoniflorin" http://purl.obolibrary.org/obo/CHEBI_36122 "pelargonidin 3-O-beta-D-glucoside chloride" http://purl.obolibrary.org/obo/CHEBI_28510 "pelargonidin chloride" http://purl.obolibrary.org/obo/CHEBI_75314 "peonidin" http://purl.obolibrary.org/obo/CHEBI_74793 "peonidin 3-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_15420 "perillyl alcohol" http://purl.obolibrary.org/obo/CHEBI_75318 "petunidin" http://purl.obolibrary.org/obo/CHEBI_31985 "petunidin 3-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_17556 "phaseollidin" http://purl.obolibrary.org/obo/CHEBI_33468 "phosphide(3-)" http://purl.obolibrary.org/obo/CHEBI_17327 "phytol" http://purl.obolibrary.org/obo/CHEBI_28821 "piperine" http://purl.obolibrary.org/obo/CHEBI_8454 "progoitrin(1-)" http://purl.obolibrary.org/obo/CHEBI_16997 "propane-1,2-diol" http://purl.obolibrary.org/obo/CHEBI_8600 "prunetin" http://purl.obolibrary.org/obo/CHEBI_8630 "pterostilbene" http://purl.obolibrary.org/obo/CHEBI_131498 "quercetin 3,4'-di-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_28299 "quercetin 3-O-beta-D-glucofuranoside" http://purl.obolibrary.org/obo/CHEBI_67486 "quercetin 3-O-beta-D-galactopyranoside" http://purl.obolibrary.org/obo/CHEBI_68352 "quercetin 3-O-beta-D-glucopyranoside" http://purl.obolibrary.org/obo/CHEBI_32082 "quercetin 3-O-beta-D-glucosyl-(1->2)-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_75839 "quercetin 4'-O-beta-D-glucopyranoside" http://purl.obolibrary.org/obo/CHEBI_131499 "quercetin 7,4'-di-O-beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_17558 "quercitrin" http://purl.obolibrary.org/obo/CHEBI_145013 "rebaudioside B(1-)" http://purl.obolibrary.org/obo/CHEBI_145022 "rebaudioside D" http://purl.obolibrary.org/obo/CHEBI_145018 "rebaudioside E" http://purl.obolibrary.org/obo/CHEBI_27881 "resveratrol" http://purl.obolibrary.org/obo/CHEBI_15035 "retinal" http://purl.obolibrary.org/obo/CHEBI_17226 "rosmarinic acid" http://purl.obolibrary.org/obo/CHEBI_8907 "rubixanthin" http://purl.obolibrary.org/obo/CHEBI_28527 "rutin" http://purl.obolibrary.org/obo/CHEBI_9055 "scoparone" http://purl.obolibrary.org/obo/CHEBI_26660 "sesterterpenoid" http://purl.obolibrary.org/obo/CHEBI_9162 "sinigrin(1-)" http://purl.obolibrary.org/obo/CHEBI_9188 "solanine" http://purl.obolibrary.org/obo/CHEBI_9209 "soyasapogenol B" http://purl.obolibrary.org/obo/CHEBI_9211 "soyasaponin I" http://purl.obolibrary.org/obo/CHEBI_145027 "steviol glycoside" http://purl.obolibrary.org/obo/CHEBI_9271 "stevioside" http://purl.obolibrary.org/obo/CHEBI_36027 "stilbenol" http://purl.obolibrary.org/obo/CHEBI_15138 "sulfide(2-)" http://purl.obolibrary.org/obo/CHEBI_9518 "theasinensin A" http://purl.obolibrary.org/obo/CHEBI_9522 "theogallin" http://purl.obolibrary.org/obo/CHEBI_28177 "theophylline" http://purl.obolibrary.org/obo/CHEBI_9630 "tomatine" http://purl.obolibrary.org/obo/CHEBI_32374 "trans-4-coumaric acid" http://purl.obolibrary.org/obo/CHEBI_16433 "trans-caffeic acid" http://purl.obolibrary.org/obo/CHEBI_145829 "trilobatin" http://purl.obolibrary.org/obo/CHEBI_27510 "umbelliferone" http://purl.obolibrary.org/obo/CHEBI_9908 "ursolic acid" http://purl.obolibrary.org/obo/CHEBI_65244 "zeinoxanthin" http://purl.obolibrary.org/obo/CHEBI_28425 "alpha-carotene" http://purl.obolibrary.org/obo/CHEBI_10223 "alpha-cryptoxanthin" http://purl.obolibrary.org/obo/CHEBI_27432 "alpha-linolenic acid" http://purl.obolibrary.org/obo/CHEBI_22470 "alpha-tocopherol" http://purl.obolibrary.org/obo/CHEBI_33270 "alpha-tocotrienol" http://purl.obolibrary.org/obo/CHEBI_30248 "beta-D-Gal-(1->3)-beta-D-GlcNAc-(1->3)-beta-D-Gal-(1->4)-D-Glc" http://purl.obolibrary.org/obo/CHEBI_22798 "beta-D-glucoside" http://purl.obolibrary.org/obo/CHEBI_17579 "beta-carotene" http://purl.obolibrary.org/obo/CHEBI_10362 "beta-cryptoxanthin" http://purl.obolibrary.org/obo/CHEBI_47771 "beta-tocopherol" http://purl.obolibrary.org/obo/CHEBI_33275 "beta-tocotrienol" http://purl.obolibrary.org/obo/CHEBI_28661 "gamma-linolenic acid" http://purl.obolibrary.org/obo/CHEBI_18185 "gamma-tocopherol" http://purl.obolibrary.org/obo/CHEBI_33277 "gamma-tocotrienol" http://purl.obolibrary.org/obo/CHEBI_47772 "delta-tocopherol" http://purl.obolibrary.org/obo/CHEBI_33276 "delta-tocotrienol" http://purl.obolibrary.org/obo/CHEBI_37168 "iota-carrageenan" http://purl.obolibrary.org/obo/CHEBI_10583 "kappa-carrageenan" http://purl.obolibrary.org/obo/CHEBI_37167 "lambda-carrageenan" http://purl.obolibrary.org/obo/CHEBI_25681 "omega-3 fatty acid" http://purl.obolibrary.org/obo/CHEBI_36009 "omega-6 fatty acid"

LilyAndres commented 1 year ago

Thanks @ddooley I am a bit confused because what we used in CHEBI is the "ChEBI Name" and not what I think is the "ChEBI ASCII Name":

Screen Shot 2022-11-23 at 2 04 28 pm

@kaiiam should this be fixed in the cdno-edit.owl as well? Instead of the Robot templates?

ddooley commented 1 year ago

It is puzzling. The chebi.owl and chebi-lite.owl files download directly from EMBL-EBI don't even have that α-carotene in it, they only have alpha-carotene.
Over on Ontobee there's no hint of α-carotene , see https://ontobee.org/ontology/CHEBI?iri=http%3A%2F%2Fpurl.obolibrary.org%2Fobo%2FCHEBI_28425 . Its as though some inaccessable version of ChEBI is driving OLS. But you guys were able to download it with that α-carotene symbol!

LilyAndres commented 1 year ago

Well we have created the Robot template based in the Chebi name label... I'm getting nervous about this issue. We might need to check the name of the labels again...

kaiiam commented 1 year ago

Luckily this is an easy enough problem to solve. The long and short is that in our NCF robot template we're adding additional labels to terms e.g.:

image

where the second column is the extra label. We already import the correct lables during the chebi import process so for example chebi-import.owl looks like:

image

We then merge them together in cdno-edit, hence why we get:

image

We should be able to fix this by simply removing the unwanted extra lables e.g. α-carotene from the nutritional_components_framework.csv file.

LilyAndres commented 1 year ago

@ddooley Apologies for the late reply in this issue, I just made a new release with changes that will help to fix this issue, please let me know if the issue is still there. Thanks a lot.

LilyAndres commented 4 months ago

We had an error while trying to compile the Nutritional_Component_Framework.csv file and I tracked it this ticket from the release "v2022-12-23". I tested all the labels by adding them again and I found that the error comes from removing the labels (second column) from the following terms in the nutritional_component_framework.csv:

CHEBI:23042 "carotene" CHEBI:28794 "coumarin" CHEBI:28802 "flavonols" CHEBI:24698 "hydroxyflavone"

If we delete these labels we get the following error:

Screen Shot 2024-05-23 at 10 08 17 pm

Any changes I try to make in the Nutritional_Component_Framework.csv file are not working until I put these labels back, run the pipeline with whatever changes I am making in this template and then it runs and makes the changes. Then I remove the labels and run the pipeline again, the changes are kept and the labels removed to avoid duplication. This is not the best thing to do but Need to work with one issue at the time.

LilyAndres commented 3 months ago

Change to "AT rdfs:label" from the label column in the nutritional_component_framework.csv file. For testing