whitmanarchive / whitman-LG_1855_variorum

Data Repo for in progress variorum
0 stars 0 forks source link

Renumber lines in TEI #74

Closed nichgray closed 4 years ago

nichgray commented 5 years ago

Right now there is a discrepancy between the line numbers as they appear in the interface and as they appear in the @xml:id in the TEI. Consider updating these in TEI in some automated fashion.

(Remember that manuscripts and relations document link to existing IDs; these would need to be updated as well if this change is made.)

nichgray commented 4 years ago

Okay, here I think is what needs to be done:

nichgray commented 4 years ago

I should clarify, per the second checklist item above, that ONLY the part between the "l" and the second "#" is affected (so for instance nothing after "#ab" should change in the above example)

jduss4 commented 4 years ago

Third item -- match the corresp value, not the element id

jduss4 commented 4 years ago

I have completed tasks 1 and 2, but after writing a script that CAN remove all those corresps, it was deemed too destructive because Nokogiri is also reformatting things that wwa likes return characters between <add> and <del> type elements which they need to appear as spaces. @nichgray will be working on an XSLT script to scrub the corresps instead.

https://github.com/whitmanarchive/whitman-LG_1855_variorum/blob/master/scripts/one-offs/line_changer/remover.rb