-
Hi,
Is there anything blocking the use of openfst 1.8.1? I have an application that relies on that version (it's the latest), but Kaldi is still using 1.7.2, and mixing openfst versions can be qui…
-
When Saka Key is enabled, this page https://www.elster.de/eportal/start is missing the menu on the left side. When Saka Key is disabled, the page renders properly.
The menu is as follows:
ELSTER…
ghost updated
5 years ago
-
Hi!
Is it possible apply Eesen for Handwriting Recognition decoding using run_ctc_char.sh? I have a handwriting recognition model trained with the CTC objective function that I use to generate post…
-
http://www.worldofspectrum.org/forums/discussion/comment/855032/#Comment_855032
Find an FOSS voice recognition that we can use
-
The message from an FST binary doesn't clearly point to the cause. An easy fix, I'll do.
The tool also trusts the index value of `#0` being 1 larger than the last phone symbol, which better be chec…
-
#echo skipped
#--dpkg-shlibdeps-params=--ignore-missing-info
debian/rules binary
dh binary --with python2,python3 --buildsystem=pybuild
dh_testroot -O--buildsystem=pybuild
dh_prep -O--buil…
-
It would be most useful if we can train the system to differentiate who said something. Depending on the person we could then start or ignore a command. For instance:
- a guest in the house can't r…
-
I am using the websocket server docker image for the english model. I am feeding it a live stream of converted (to wav) audio for telephony purposes. I have noticed that the websocket returns parsed t…
-
[I hit send too soon on this; I'm updating the comment.]
I think the time might have come to create an 's5b' version of the WSJ setup.
WSJ is the oldest setup and the local scripts are not up to t…
-
The utterances in the TEDLIUM dataset roughly range from 8 to 15 seconds.
I have a dataset with shorter utterances, ~5 to 10 seconds long.
What are the optimal and minimum lengths of utterances …