FrankensteinVariorum / fv-collation

first-stage collation processing in the Frankenstein Variorum Project. For post processing and Variorum development, see our GitHub organization: https://github.com/FrankensteinVariorum
https://frankensteinvariorum.github.io/fv-collation/
GNU Affero General Public License v3.0
9 stars 2 forks source link

Saxon alternative via Python? #81

Open ebeshero opened 2 years ago

ebeshero commented 2 years ago

Try a Python wrapper around Saxon-C to deal with the parsing of XML to prepare the collation. This may be better than Pulldom which has an idiosyncratic way (very difficult to read) for handling XML tags. Thanks to @djbpitt for the idea

ond1 commented 11 months ago

SaxonC 12.X now releases python wheels (saxonche for open source and the commercial products saxoncpe and saxoncee

ebeshero commented 11 months ago

@ond1 We left this issue open from last year, but decided since then to stick with Pulldom for the collation string preparation. It took us a while to be sure of that as the best method but it worked out!

ebeshero commented 11 months ago

@ond1 Thanks for this update, though! We will apply it in other projects!