ffdev-info / wikidp-issues

An issues repository for resolving issues in Wikidata around the records relating to Digital Preservation
GNU General Public License v3.0
1 stars 0 forks source link

Cannot easily tease apart signatures from different sources to find their discrete boundaries #22

Open ross-spencer opened 3 years ago

ross-spencer commented 3 years ago

Description of problem

As Wikidata's structure is fairly flat we need to either trust a record's identification patterns either belong together, or that they are separated somehow (usually the next BOF in the sequence). Presently Siegfried contains some heuristics (to be described) but having Wikidata describe this vs. Roy work it out would be ideal.

To get an idea about what Roy can currently process, run ./roy build -wikidatadebug.

Permalink

ross-spencer commented 3 years ago

Related to: https://github.com/ross-spencer/WikiDP-Issues/issues/15