Closed amclark42 closed 11 months ago
I recommend doing the remaining manual touchups in the add-keywords
branch, so the links in the report stay useful. I suggest that this happen before merging the PR.
An alternate workflow would be to merge the PR now but keep the branch open for further edits.
I'll complete the manual touchups in this branch. As of a few minutes ago, I also made adjustments to DHQauthor-TEI.rng to allow for it to validate correctly spelled and assigned keywords in @corresp
of <term>
All problematic articles (with CDATA sections) have been manually keyworded. Articles with previous @scheme="dhq_keywords"
attributes of <keywords>
have been combined with independently assigned keywords. Most were authorial keywords placed in the incorrect spot. Article 000695's issues were not resolved, but the article appears to be in early stages of encoding.
The xslt transform for after TEI-Garage and author template have been updated to reflect the new format of <term>
for DHQ-keywords
Spelling errors and extraneous spaces removed from keyword values per articles flagged in the erroneous values report
@juliaflanders Since regular encoding work has continued over the past week or so, a few merge conflicts have arisen. We'll have to manually resolve these before merging add-keywords
into main
, but this should be a fairly simple process. Worst case scenario if any files have changed drastically, I can always just keep main
's version of changes and manually enter keyword terms.
<keywords>
added programmatically to DHQ articles, using the output of @blaak-18's Python script as generated by @brgrey!Some articles already had DHQ keywords attached; the original lists have been retained below the newer ones. The articles containing CDATA have been left alone, since XQuery or XSLT would remove those declarations during parsing. A report is available in the repository wiki.