-
Is there any way to specify a speech recognition grammar? I am sure that deepspeech will work better, if it is used with a grammar. Am I missing something or is this not yet implemented?
A possible…
-
When calculating the speech features for the speech2text models, OpenSeq2Seq calculates a mean and stddev individually for each training sample. Much like batch normalization, during inference, it wou…
-
Subscribe to this issue and stay notified about new [daily trending repos in all languages](https://github.com/trending?since=daily)!
-
### System Info
Is there a parameter someplace for audio pipeline support for initial_prompt?
like this: https://github.com/openai/whisper/discussions/963
$ whisper --help
optional arguments:
…
-
**Is your enhancement request related to a problem? Please describe.**
Not all humans know the same language.
**Describe the solution you'd like**
I'd like to have the auto-gene…
-
Can someone share a voice sample he created with this repository based on a given and/or a custom set of voice files
-
The following is the error code I got:
Collecting jamo
Using cached jamo-0.4.0.tar.gz (7.3 kB)
Preparing metadata (setup.py) ... error
error: subprocess-exited-with-error
× python s…
-
We've been using pocketsphinx.js successfully for some time.
We've recently discovered (in Google chrome) that the accuracy is very bad on computers with comparatively less CPU, while it works per…
-
The documentation says
> Once this is working properly you may wish to download one of the larger language models for more accurate dictation. They are available [here](https://alphacephei.com/vos…
-
I am trying to use the inference boosted by LM technique for ASR, but I am getting an error. Can you please help? I follow the guidelines as described here: https://github.com/jonatasgrosman/huggingso…