PerseusDL / canonical

This will be the base repo for all text and annotation data published in the PDL
16 stars 17 forks source link

betacode to unicode conversion: notes #81

Open lcerrato opened 9 years ago

lcerrato commented 9 years ago

beta code in notes was not converted to Unicode (noticed in tlg0023.tlg001.perseus-grc1.xml probably elsewhere as well)

srdee commented 9 years ago

When this is done, betacode should first be standardized as per: https://github.com/PerseusDL/tei-conversion-tools/wiki/Greek-Betacode-to-Unicode-Transformations

srdee commented 9 years ago

https://github.com/PerseusDL/tei-conversion-tools/blob/master/jar/tei.transformer.lang_grc.jar works to convert betacode in notes to UTF-8, but produces the errors outlined in https://github.com/PerseusDL/tei-conversion-tools/wiki/Greek-Betacode-to-Unicode-Transformations