Some Chinese names have unnecessary spaces at the end when transliterating

avian2 / unidecode

ASCII transliterations of Unicode text - GitHub mirror

GNU General Public License v2.0

534 stars 62 forks source link

When trying to transliterate
"马云" I receive
"Ma Yun " (notice the space in the end) instead of
"Ma Yun"

Here's the code you can use to replicate this issue:

import unittest
import unidecode

class TestStrings(unittest.TestCase):
    def test_replace_non_ascii_letters_with_chinese_name(self):
        self.assertEquals(unidecode.unidecode("马云"), "Ma Yun")

The test fails with the following error:

AssertionError: 'Ma Yun ' != 'Ma Yun'
- Ma Yun 
?       -
+ Ma Yun

Run on Python 3.8.5

EDIT:

Google Translate seems to be doing this with no issue, but perhaps Google Translate has the faulty transliteration. Chinese speakers welcome to correct me.

avian2 / unidecode

Some Chinese names have unnecessary spaces at the end when transliterating #64