-
Hi,
first of all, thank you for providing this repo! I was able to set up speech recognition on my Jetson Nano 2GB relatively easily with it.
However, the quality of the prediction with the micropho…
-
Thanks for your greate job.I have tried this project in my own computer(win10, 1060ti 3gb), and I think the similarity of voice is good.Do you have any ideas of how to improve the qulity and similarit…
-
Hi I'm currently starting to learn the espnet framework, and found confused in LM. When reading the Branchformer paper, I found that they mentioned they were using the transformer LM downloaded from E…
-
### System Info
None
### Information
- [X] The official example scripts
- [ ] My own modified scripts
### 🐛 Describe the bug
Hello,
Thank you for your excellent work on llm-asr.
I've been c…
-
I'm seeking guidance on how to incorporate a Phone-Based Language Lexicon (in icefall/egs/librispeech/ASR/prepare.sh in Step 6) into the latest Zipformer Model, a state-of-the-art solution in speech r…
-
Hello! I arrived here from your paper in InterSpeech and am very impressed by what you have been able to do with Speech Summarization!
I had a question involving the code. I am trying to train a m…
-
### 🐛 Describe the bug
When exporting the following model using the `dynamo_export`, there's an error about an `Unexpected type in sourceless builder`:
```py
import torch
from nemo.collections.asr.…
msis updated
3 months ago
-
Is there direct command line to call to convert text to phonemes? i don't want the alignments to the audio, just the phonemes. The use case is after training with a TTS model, in the inference time w…
-
FYI
I ran the transformer training with mmi + attention with different batch sampling settings to check which sampler gives better WER. The results are from the averaged model using the last 5 chec…
-
hi, thank you developed this great toolkit and open source.but i have some problem and cant solve by myself.
i run train.py in examples/conforme.After 20 epoches, loss reduce to 6.92。 but i get non…