mozilla / DSAlign

DeepSpeech based forced alignment tool
Mozilla Public License 2.0
235 stars 33 forks source link

Feature/deepspeech 0.6.0 #15

Closed BoneGoat closed 4 years ago

BoneGoat commented 4 years ago

Tested with the following params:

--output-max-cer 15 --loglevel 10 --audio data/test1/audio.wav --script data/test1/transcript.txt --aligned data/test1/aligned.json --tlog data/test1/log.txt --stt-model-dir models/en/deepspeech-0.6.0-models --alphabet models/en/alphabet.txt --stt-no-own-lm

After fix for missing alphabet.txt I tested with the following params:

--output-max-cer 15 --loglevel 10 --audio data/test1/audio.wav --script data/test1/transcript.txt --aligned data/test1/aligned.json --tlog data/test1/log.txt --stt-no-own-lm

Still need to figure out how to solve the following error when running without stt-no-own-lm:

Error: Trie file version mismatch (4 instead of expected 5). Update your trie file.

Got passed this error by using v0.6.0 of the native client from GitHub extracted into dependencies/deepspeech. taskcluster.py is supposed to fix this but taskcluster.net seems deprecated.

tilmankamp commented 4 years ago

I tested your branch and all seems OK - you first have to remove all the old (0.5.x) model files by rm -rf models in your project root and call bin/getmodels.sh again.

There is just one (new) caveat with the current model archive - it misses the English alphabet.txt file. So getmodels.sh should be extended to fetch it from here.