DOREMUS-ANR / knowledge-base

Repository containing controlled vocabularies and data published by DOREMUS
http://data.doremus.org/
Apache License 2.0
16 stars 7 forks source link

[MoP] Mapping "unspecified" labels #3

Closed pasqLisena closed 7 years ago

pasqLisena commented 8 years ago

Simpler for me explaining with an example.

In MARC file, you can find as medium of performance orchestra. In the vocabulary, there is orchestra - unspecified, that conceptually should be the same.

The String2URI fails in this case.

Probably this problem will disappear when we will complete the alignment for the MoP (so more labels will be available). But we should keep in mind this possible issue.

rtroncy commented 8 years ago

For the medium of performances, the source data in MARC makes use of labels or of codes? Does this vary depending if it comes from Philharmonie or BnF? In your example, this was Intermarc or Unimarc? I agree, when the vocabulary will be aligned and consolidated, the problem should be resolved but this is useful to keep it in mind.

pasqLisena commented 8 years ago

For the medium of performances, the source data in MARC makes use of labels or of codes? Does this vary depending if it comes from Philharmonie or BnF? In your example, this was Intermarc or Unimarc?

Both PP and BnF, both Intermarc and Unimarc, make use of labels in French (actually the problem was between orchestre and orchestre - non spécifié).

pasqLisena commented 8 years ago

As proposed by @marie-ototoi, we can remove the - unspecified part, because we defined broader/narrower relationship (see #2 ).