Pittsburgh-NEH-Institute / pr-app

eXist-db app development
MIT License
3 stars 1 forks source link

[IMPEDED] Duplicate @xml:id values #39

Closed djbpitt closed 2 years ago

djbpitt commented 2 years ago

The following three XML files:

have the same @xml:id value: GH-19CUK-18380120.

There are also two files that have the shared @xml:id value GH-BLNPS-18040106:

Because we decided to use the @xml:id as a unique identifier in the corpus, the values all need to be different. We should:

  1. Change any duplicate values to something unique and document our system for creating @xml:id values for documents.
  2. Create a Schematron rule to validate each the @xml:id value of each document against the entire collection.

It is not possible to create working links between advanced search results and a reading view unless all files have unique @xml:id values. The Schematron fix is less urgent.

djbpitt commented 2 years ago

Fixed

djbpitt commented 2 years ago

Issue was closed prematurely; new @xml:id values have not yet been pushed.

2022-06-04: djb Created schematron to validate unique @xml:id, associated with all XML source files.

djbpitt commented 2 years ago

Fixed for real this time