cmusphinx / sphinx4

Pure Java speech recognition library
cmusphinx.sourceforge.net
Other
1.4k stars 586 forks source link

Updated the en-us dict #83

Closed Coeur closed 6 years ago

Coeur commented 6 years ago

This updates the dictionary.

The script used is visible at https://github.com/Coeur/cmudict/blob/no-stress/stripStress.swift.

1) strips comments starting with # 2) strips digits 0, 1, 2 from phones 3) strips duplicates 4) re-index

Original order of entries is unaltered.

Coeur commented 6 years ago

If that makes it easier to review, it's the same file as https://github.com/cmusphinx/pocketsphinx/pull/124

nshmyrev commented 6 years ago

Thank you!