Open gregorycrane opened 1 year ago
Thanks @gregorycrane–I'll take a look before our next call.
Building a list of things here that I can demonstrate with a PR back to https://github.com/gregorycrane/BornDigitalAlignedXlations:
ʼ
(0x2BC
), but the tokens here use '
(0x27
) vs ’
(0x2019
)&qoute;
tags need to be excludedI'm working around them in a processing script that I can also link back.
I have created some files a starting point for alignment. I have added an extra file for "glosses", where we use the alignment data to generate contextual glosses: https://github.com/gregorycrane/BornDigitalAlignedXlations/blob/main/README.md
I have not put the alignments into JSON but they are many-to-many lists. I can take the next step and do the JSON work if we want to use the DUCAT format as a standard (if that is what we are basing our JSON work on). I just wanted to get something out sooner rather than later!