interscript / maps

Script conversion maps for Interscript
2 stars 1 forks source link

Implement system `alalc-zho-hani-latn-1997` (ALA-LC Romanization Table -- Chinese (1997) #100

Open ronaldtse opened 4 years ago

ronaldtse commented 4 years ago

This issue is to implement the transliteration system of alalc-zho-hani-latn-1997.

This system is referred in the GeoNames database as zho_Hani2Latn_ALA_1997, with the system title 'ALA-LC Romanization Table -- Chinese (1997)'.

Tests should rely on the data extracted for the zho_Hani2Latn_ALA_1997 system in https://github.com/riboseinc/geonames-transliteration-data .

ronaldtse commented 4 years ago

System described here: http://catdir.loc.gov/catdir/cpso/romanization/chinese.pdf

ronaldtse commented 4 years ago

This system is basically an alias to pinyin, but the capitalization rules are complex and some are rather arbitrarily applied. Not sure if we can actually do that even with good ML.