Closed haney closed 5 years ago
The issue is fixed in v0.1.11, should be on PyPI already. Thanks @haney for bug report and especially for providing a fix!
I have not built Windows binary for ged2doc
with this fix, I presume you do not need that but let me know if you want it rebuilt.
Description
Combining characters in ANSEL documents do not appear to be handled appropriately. In the ANSEL encoding, combining characters occur before the character they modify, however in Unicode, they occur after. This translation does not appear to be happening when reading ANSEL GEDCOM documents.
What I Did
Given the document, I would have expected the output:
Instead I'm seeing
This implies that the position of the combining character is unchanged when it was translated to unicode, however given the rules for combining characters in unicode, it is getting applied to the first character instead of the second.