irvin / voice-text-tools

Tools to modify sentences data of Common Voice project
MIT License
5 stars 0 forks source link

More languages #1

Open AndrewNLauder opened 6 years ago

AndrewNLauder commented 6 years ago

Hi Irvin,

How might someone add support for an additional language? Can this be used against the existing English common voice corpus? I'd be curious to see if the phonetic coverage is sufficient for training a model.

Thanks! Andrew

irvin commented 6 years ago

Hi @AndrewNLauder

I had little knowledge about English linguistics, which is different to Chinese in phoneme system.

In my imaginary, it should be much easier to estimate the phonetic coverage of English, because we don't need to "covert logograph character into syllable".

irvin commented 6 years ago

I think theoretically we need a syllables table of English. Need some linguistic support...

AndrewNLauder commented 6 years ago

Can you give an example? Isn’t this information available in Wikipedia?

Sent from my iPhone

On Aug 10, 2018, at 12:39 AM, Irvin notifications@github.com wrote:

I think theoretically we need a syllables table of English. Need some linguistic support...

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.