Closed sam-s closed 5 years ago
Unicode spaces are transliterated correctly. I'm guessing you mean the EN QUAD
character. This is code point U+2000 (in hex). You forgot 0x
in front of your 2000
.
>>> from unidecode import unidecode
>>> unidecode("a"+chr(0x2000)+"b")
'a b'
Current behavior:
Desired behavior:
Rationale
Unicode spaces naturally correspond to the usual ascii spaces.