-
Hello, I would like to ask why the choice of glove embeddings is Common Crawl and the choice of agwe embeddings is librispeech in the code. Shouldn't the choice of glove embeddings also be librispeech
-
Hi there, I am sorry, but in the paper, there is no detail about the features S and T? which acoustic features and which word embeddings have you used there? Precisely what is the shape of the feature…
-
Task :
Create an offline alternative to Google's [read along app ](https://readalong.google.com/) in Hindi. It should be able to show a set of words and be able to determine if you have spoken the …
-
Hi @radekosmulski , I figured I'd open a new issue to discuss the paper itself so we can keep using #6 for your updates only.
Forgive me in advance if some of my questions have already been discus…
-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
Hi. I apologize if this is not the right place to post a question, but I wasn't sure where to post it.
I have a few questions about JoinAP from Zhu et al 2021 (https://arxiv.org/pdf/2107.05038.pdf)…
-
I dont know if this is a bug or just some kind of usage error but i have tried both pocketsphyx and prorcupine as wake word providers. Both work and have good recognition rates but the recognition aft…
-
Hi how can learn model new words that don't see it.
How can learn model a new pronunciation and how can deal with crossover pronunciation of two neighbor words?
In HMM-based we can generate phones o…
-
Let’s face it. KenLM has served us well…
…but it has its limitations. It didn’t aged well as a language model architecture.
First order of business is to compute a bi directional vector representa…
-
Hi, In **NATSpeech**, inappropriate dependency versioning constraints can cause risks.
Below are the dependencies and version constraints that the project is using
```
matplotlib
librosa==0.8.…