RTXteam / RTX-KG2

Build system for the RTX-KG2 biomedical knowledge graph, part of the ARAX reasoning system (https://github.com/RTXTeam/RTX)
MIT License
34 stars 9 forks source link

Address `biolink:InformationContentEntity` in KG2 #326

Open ecwood opened 11 months ago

ecwood commented 11 months ago

Spun off of #286 and https://github.com/NCATSTranslator/reasoner-validator/issues/88#issuecomment-1633139389

ecwood commented 11 months ago

While looking at "Walloon Language" (from the table, since there are a lot of languages, I thought this might be an easy tree to find a better source for), I found this very odd node:

{
  "iri": "https://identifiers.org/ncit:C51277",
  "category_label": "disease_or_phenotypic_feature",
  "deprecated": "True",
  "name": "Spoken Language",
  "description": "Language communicated by means of words or sounds uttered through the mouth.; UMLS Semantic Type: STY:T033",
  "provided_by": "['infores:ncit']",
  "id": "NCIT:C51277",
  "category": "biolink:DiseaseOrPhenotypicFeature",
  "update_date": "2021"
}

I am very unsure why this (an ancestor of "Walloon Language") would be categorized as a disease_or_phenotypic_feature.

Apparently, though, T033, is categorized as a disease or phenotypic feature: https://github.com/biolink/biolink-model/blob/f2519ec416ce62efb8276cee48f28e95256e8e7e/biolink-model.yaml#L7565-L7579