Polished Collation Notes and Checklist

Checklist

For each section, are reserved fragment files incorporated into collation output files? Have collation <app> elements been proofed and corrected for consistent semantic comparison of witnesses?

[X] Part 1 (C01 - C06 in pieces)
[X] Part 2 (C07 in pieces)
[x] Part 3 (C08 in pieces)
[x] Part 3.2 (C09 to C10)
- [X] C09
- [x] C10
[x] Part 3.5 (C11 to C14a)
- [x] C11 (in pieces)
- [x] C12 (in pieces)
- [x] C13 (in pieces)
- [x] C14a
[x] Part 4 (C14, b-e pieces)
[x] Part 5 (C15 and C16 pieces)
[ ] Part 6
[ ] Part 7
[ ] Part 8
[ ] Part 9
[ ] Part 10
[ ] Part 11
[ ] Part 12

Notes

Deleted passages in Thomas: `<del>`

I am hand-correcting the collation output so that fully deleted passages in Thomas sit in one <app>. This is because one act of deletion is a semantic alteration to the text that is not properly recorded when the collation splits these across multiple app elements. In the ordinary output, when a deletion is longer than a word or two, we get a deletion start-marker in one app, followed by another app (or more) where all witnesses appear to be in unison before diverging again in a last app with the deletion end-marker. That obscures the nature of the change, so I'm unifying the full deletion and comparison in one <app>.

Status of `<add>` elements

The collation process has put <add> elements in the "ignore" list, so that their contents are consumed and output, but we don't see the <add> in any form from the collateX output.<add> elements were ignored to simplify the processing of the msColl. However(!) in ignoring the <add> elements the output collation now is missing information about a) which portions of the MS were inserted by Percy's hand, and b) the location of hand additions in the Thomas copy. I have included the Thomas <add> elements when I am patching in collation fragments by hand, that is, passages that did not align evenly (=the long passages where an insertion was indicated). In short, not all <add> tags are preserved, except those around the lengthier segments that I had to maneuver by hand into the collation. And the original <add> information is preserved, of course, in the S-GA files.

Options re `<add>` info from S-GA and Thomas:

Perhaps we should eventually patch the <add> information, where interesting (e.g. Percy's hand) into the output Variorum. Following hand-correction of app elements, it is no longer feasible to re-run the collation, so changing the collation algorithm at this late a stage is counter-productive (=more loss than gain in terms of the Variorum priorities of our edition to align semantically meaningful units of comparison first and foremost.)

White-space issues

I've been correcting white space issues in the output collation: where words sit at line-endings in Thomas_fullFlat (and full), for example, the collation smushes two words together as an alternate reading. Not good, but infrequent so far.

Pointers to all editions?

If we are using pointers to pull in specific locations of each source edition, we can/should eventually be pointing into the Thomas_full and the "full" versions of each source edition, so that and other information (such as what we preserved in <comment> elements) is part of the Variorum.
Perhaps we can work with the pointers to import info from <add> elements discussed above.

FrankensteinVariorum / fv-collation