INCATools / kgcl

Datamodel for KGCL (Knowledge Graph Change Language)
https://w3id.org/kgcl/
MIT License
11 stars 4 forks source link

Consider a replace keyword command #72

Open cmungall opened 5 months ago

cmungall commented 5 months ago

This may be out of scope but wanted to add for discussion.

Frequently we want to replace instances of a token, word, or phrase in lexical ontology elements such as labels, definitions.

E.g

https://wiki.geneontology.org/Ontology_meeting_2024-04-08#Bulk_update_specific_keywords

This may be out of scope as it drifts into NLP territory. How are the boundaries of words or tokens defined? Do we have different tokenizers for chemicals vs biological language? What about bespoke rules that exclude certain tokens from replacement in some contexts?

This may be better handled by a separate tool that generates KGCL rename and redefine commands given an ontology plus some replacement rules.

In fact the logic is very similar to that uses for the synonymizer