Closed geoHeil closed 1 year ago
if you import from jellyfish._jellyfish you'll get the Python version that handles unicode properly
still unsure what to do about C versions
@jamesturk is this still an issue? as far as I can see this is fixed:
>>> jellyfish._jellyfish.match_rating_codex('ä')
'Ä'
>>> jellyfish.cjellyfish.match_rating_codex('ä')
'Ä'
added a test to confirm this is fixed & avoid regression
jellyfish.match_rating_codex('ä')
fails withValueError: character U+ffffffff is not in range [U+0000; U+10ffff]
how should umlauts be handled to be fit for jellyfish?