-
I've tried generating alignments for a `pruned_transducer_stateless7` model using https://github.com/k2-fsa/icefall/blob/master/egs/librispeech/ASR/pruned_transducer_stateless7/compute_ali.py. Looking…
-
Thanks for your great work!
I want to try to train a audio encoder with CLAP and then use this encoder and ctc to fintune ASR datasets.
May I ask if this method is feasible, or if there are better…
wntg updated
10 months ago
-
Thank you very much for sharing the code for this work! However, in the attack_utils.py
`from data_utils import wav2mel_tensor, Transform`,
I meet the error that cannot find reference `Transform` i…
-
https://isv-data.oss-cn-hangzhou.aliyuncs.com/ics/MaaS/ASR/test_audio/asr_vad_punc_example.wav
https://isv-data.oss-cn-hangzhou.aliyuncs.com/ics/MaaS/ASR/test_audio/asr_example_zh.wav
这两个音频为什么…
-
Hi @Masao-Someki,
In the readme the example for streaming asr shows the use of start() and end() methods:
```
from espnet_onnx import StreamingSpeech2Text
stream_asr = StreamingSpeech2Text(tag…
-
Hi Thanks for great speech tool kit.
I am using the Librispeech ASR recipe for training with my custom data. Due to CPU memory limitations, I am utilizing "num_splits_asr." Additionally, because of…
-
Hi, thank you for publishing such a great work!
I just want to make sure whether the customized My_WavVec2CTCTokenizer is a phoneme-level tokenizer, which contains only phoneme inventory. In the file…
-
Notice: In order to resolve issues more efficiently, please raise issue following the template.
(注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节)
## ❓ Questions and Help
SenseVoice 是否支持实时的ASR?
### Before aski…
-
## Description
The transcript will be `en` for English words and `hi` for Hindi words. Using Whisper/Fairseq. Also, an alternate model that gives transcript in 100% `hi` or 100% `en`.
## Why
1.…
-
## ❓ Questions and Help
#### What is your question?
I'm facing the following issue while running the MMS ASR inference script `examples/mms/asr/infer/mms_infer.py`:
```
File "/workspace/fa…