gooofy / zamia-speech

Open tools and data for cloudless automatic speech recognition
GNU Lesser General Public License v3.0
443 stars 84 forks source link

Export every form from lexicon #75

Closed pguyot closed 4 years ago

pguyot commented 5 years ago

Kaldi is supposed to handle alternate pronunciations in lexicon.

I believe there was a bug in speech_kaldi_export.py where only the first form was selected, ignoring alternate pronunciation. With current dict-fr.ipa, the first pronunciation is often not the most frequent.

pguyot commented 5 years ago

The same bug occurs in adapt script.