avian2 / unidecode

ASCII transliterations of Unicode text - GitHub mirror
https://pypi.python.org/pypi/Unidecode
GNU General Public License v2.0
516 stars 62 forks source link

Conversion can be inaccurate in the context of scientific units. #65

Closed mmcdermott closed 1 year ago

mmcdermott commented 3 years ago

image In reality, 'μmol' is micro mol and mmol is mili mol, so 1 mmol is 1000 μmol

'umol' would also be an acceptable transliteration in this case.

avian2 commented 3 years ago

This won't be fixed in Unidecode, sorry.

"m" is a better ASCII transliteration for letter mu when used in Greek language. With context free replacements Unidecode can't distinguish whether the character is used as a SI prefix or part of a Greek word.

Codepoints for SI units like ㎛ should already be transliterated as you suggest.