skwsm / gmo

the Growth Medium Ontology
1 stars 0 forks source link

同じシノニムがあるGMO #7

Open okbp opened 2 years ago

okbp commented 2 years ago

同名のラベルを持つGMOが複数あるケース。ラベルは[ rdfs:label | skos:prefLabel | skos:altLabel | gmo:GMO_000110 ] 自動アサインが出来ないので可能であれば修正して頂きたい。

合計14件

"" //空文字 GMO_001055 GMO_002204 GMO_002219

"C16H34" GMO_002033 GMO_002098

"(C5H8O4)n" GMO_002008 GMO_002009

"C5H12O5" GMO_002090 GMO_002092

"Malic acid" GMO_001992 GMO_001993

"Azlocilline" GMO_002240 GMO_002241

"Azlocillinum" GMO_002240 GMO_002241

"C56H98N16O13" GMO_002197 GMO_002323

"PABA" GMO_001026 GMO_001616

"4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid" GMO_001142 GMO_001738

"Ferrous ammonium sulfate hexahydrate" GMO_001076 GMO_002045

"Fructose (20% solution)" GMO_001123 GMO_001464

"C5H10O5" GMO_002000 GMO_002002

"C18H32O16" GMO_002012 GMO_002018

okbp commented 2 years ago

ラベル重複を検出するSPARQL グラフ名は環境に応じて書き換える

prefix gmo: <http://purl.jp/bio/10/gmo/>
prefix skos: <http://www.w3.org/2004/02/skos/core#>

SELECT  ?label GROUP_CONCAT(DISTINCT ?gmo_id ; separator=", ")  AS ?gmo_ids (COUNT(DISTINCT ?component) AS ?count)
FROM <http://localhost:8893/gmo>
{
  VALUES ?label_type {rdfs:label skos:prefLabel skos:altLabel gmo:GMO_000110  }
  ?component rdfs:subClassOf ?sub_class ;
    ?label_type ?label .
  ?sub_class rdfs:label ?component_type .
  FILTER (lang(?label) = 'en')
  FILTER (lang(?component_type) = 'en')
  FILTER (?sub_class != gmo:Obsolete_classes )
  ?component rdfs:label ?rdfs_label .
  FILTER (lang(?rdfs_label) = 'en')
  BIND (replace(STR(?component), "http://purl.jp/bio/10/gmo/", "") AS ?gmo_id)
} GROUP BY ?label HAVING(COUNT(DISTINCT ?component) > 1) ORDER BY DESC 3
okbp commented 2 years ago

'Na2B4O7"と"amphotericin B"は #3 で対応されていた為リストから削除