Closed mpuels closed 6 years ago
Hi Marc,
ai-sentences.txt is the bridge between my zamia-ai and speech projects. The idea is to dump out all sentences accepted by zamia-ai's model to make sure they are covered by the language models for sphinx/kaldi. and yes, this is completely optional, you can have an empty ai-sentences.txt or put any other sentences you want to make sure they're covered into it.
Cheers,
Guenter
Hi Guenter,
I'm trying to get your scripts running and have a question regarding the training of the German language model. I've run
which writes
speech/data/dst/speech/de/punkt.pickle
. Andwrites
speech/data/dst/speech/de/sentences.txt
. So far so good. Now I'd like to run speech_build_lm.py, but according to the linesit also needs the file
'data/dst/speech/%s/ai-sentences.txt'
. The commandyielded
So the question is: What does
ai-sentences.txt
contain and how do I create it? To train the language modelai-sentences.txt
is not necessary, because we havesentences.txt
. But I'd like to know whereai-sentences.txt
comes from :smile:Thanks for your help in advance!
Cheers, Marc