Digital-Humanities-Quarterly / dhq-journal

DHQ is an open-access, peer-reviewed journal of digital humanities.
http://www.digitalhumanities.org/dhq/
10 stars 5 forks source link

Add keywords #45

Closed amclark42 closed 11 months ago

amclark42 commented 12 months ago

<keywords> added programmatically to DHQ articles, using the output of @blaak-18's Python script as generated by @brgrey!

Some articles already had DHQ keywords attached; the original lists have been retained below the newer ones. The articles containing CDATA have been left alone, since XQuery or XSLT would remove those declarations during parsing. A report is available in the repository wiki.

amclark42 commented 12 months ago

I recommend doing the remaining manual touchups in the add-keywords branch, so the links in the report stay useful. I suggest that this happen before merging the PR.

An alternate workflow would be to merge the PR now but keep the branch open for further edits.

brgrey commented 12 months ago

I'll complete the manual touchups in this branch. As of a few minutes ago, I also made adjustments to DHQauthor-TEI.rng to allow for it to validate correctly spelled and assigned keywords in @corresp of <term>

brgrey commented 12 months ago

All problematic articles (with CDATA sections) have been manually keyworded. Articles with previous @scheme="dhq_keywords" attributes of <keywords> have been combined with independently assigned keywords. Most were authorial keywords placed in the incorrect spot. Article 000695's issues were not resolved, but the article appears to be in early stages of encoding.

The xslt transform for after TEI-Garage and author template have been updated to reflect the new format of <term> for DHQ-keywords

brgrey commented 12 months ago

Spelling errors and extraneous spaces removed from keyword values per articles flagged in the erroneous values report

brgrey commented 11 months ago

@juliaflanders Since regular encoding work has continued over the past week or so, a few merge conflicts have arisen. We'll have to manually resolve these before merging add-keywords into main, but this should be a fairly simple process. Worst case scenario if any files have changed drastically, I can always just keep main's version of changes and manually enter keyword terms.