avian2 / unidecode

ASCII transliterations of Unicode text - GitHub mirror
https://pypi.python.org/pypi/Unidecode
GNU General Public License v2.0
516 stars 62 forks source link

Tironian "et" being changed to numeral, 7 #57

Closed AdeDoyle closed 3 years ago

AdeDoyle commented 3 years ago

Hi.

I've noticed that the Unicode character, "⁊", a Tironian note standing for Latin "et", is being changed to the numeral, "7". While they look alike, this doesn't seem like the best way to decode it, as it is never used to represent a numeral.

In Old Irish and Latin texts it was used as short-hand for the full word "et", so perhaps it should be replaced with "et". Alternatively, it is still used as an ampersand in modern Irish, so, maybe "&" would be a better replacement option?

Le meas, Adrian.

avian2 commented 3 years ago

Hi. thanks for your comment. I'm not familiar with Irish language. Based on what you said I'm more in favor of changing it to "&".

AdeDoyle commented 3 years ago

I think that would make the most sense in a modern context. I don't believe it continues to be used in languages other than Irish, except in variant abbreviations of the Latin expression, "et cetera", such as "⁊rl." and "ꝛc."

avian2 commented 3 years ago

Fixed. Thanks.