srophe / caesarea-data

Data repository for Caesarea-Maritima.org
0 stars 2 forks source link

Remove empty notes, context descs, and bibls #112

Closed wlpotter closed 2 years ago

wlpotter commented 2 years ago

Also remove empty ('extra') tei:citedRange elements in bibls that are used (see the additional bibls in https://caesarea-maritima.org/testimonia/133 for an example)

wlpotter commented 2 years ago

I can do these all, I believe, with find-and-replace-in-files using regex or even just string searching. Replace all of these with an empty string, except for the 'remove extra whitespace'.

wlpotter commented 2 years ago

Write these as a script instead for running these as part of a post-processing script

wlpotter commented 2 years ago

I believe my updates here have added this to the post-processing script, but I need to do some testing and make sure nothing broke. I'm also not sure if that script will break on data that's already been processed? All this to say that I'm gonna do the find-and-replace searches above on current data.

wlpotter commented 2 years ago

These should all be fixed. Leaving this open to spot-check; and to remind me to open an issue about testing and revising the the post-processing script to be more user-friendly.