gbv / cocoda

A web-based tool for creating mappings between knowledge organization systems.
https://coli-conc.gbv.de/cocoda/
MIT License
39 stars 5 forks source link

Automatically detect and link notations mentioned in concept notes #76

Open stefandesu opened 6 years ago

stefandesu commented 6 years ago

In some notes, especially in the editorial notes of DDC, there are references to other notations. We could use a regular expression in combination with the URI namespace of the respective scheme to find notations, determine their URI, and turn them into a link. (See also #70.)

stefandesu commented 5 years ago

Even when using notation regexes, this won't be easy to solve because there will be a lot of false positives for most concept schemes. Therefore, I removed the milestone and added a "question` label to further plan/discuss this.

nichtich commented 5 years ago

The rate of false positives depends on the particular concept scheme notations. We may enable it only for selected concept schemes and add further limits (e.g. a minimum notation length) to minimize false positives. Moreover false positives are less problematic with #278.

stefandesu commented 2 years ago

As far as I can see, the source data in MARC XML usually contains links to other concept in notes in a structured way. How about adding something similar in JSKOS format?