dmort27 / epitran

A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
MIT License
630 stars 121 forks source link

Fix Wu #115

Closed kalvinchang closed 2 years ago

kalvinchang commented 2 years ago

Continuation of #112 Note - for words made of 2+ characters, Wiktionary will assign the tone to the first character for left tone sandhi. This tone indicates the tonal category that dictates the tone sandhi, as explained in https://en.wiktionary.org/wiki/Wiktionary:About_Chinese/Wu. Right tone sandhi puts the tone on each character and joins the characters with a +

For example, Left tone sandhi: 1'la mi Right tone sandhi: 3non+2hau

what is a good solution for handling the tone sandhi?

for now, since our use case is to transcribe monosyllabic words (characters themselves), we will ignore this issue.