-
推理时,使用文档中的语句:
```bash
python inference.py --asr hubert --dataset ./your_data_dir/ --audio_feat your_test_audio_hu.npy --save_path xxx.mp4 --checkpoint your_trained_ckpt.pth
```
![image](https://…
-
I have finetuned xlsr-128 pretrained model on my own dataset using fairseq toolkit. After that I wanted 'checkpoint_best.pt' to be used as a frontend for CTC/attention based architechture for asr task…
-
## MRE
```f90
program array_operations
implicit none
! Declare variables
integer, dimension(5) :: array
real :: scalar_value
integer :: i
real, dimension(5) :: x,…
-
实现数字人语音对话搞这两个html文件?
-
Hello! Great plugin. I've noticed that the getting started and how to use documentation is a little on the sparse side. Would be super if it were fleshed out more, especially for less savvy folks. As …
-
**How can we improve our ASR model to reliably output an empty string for unintelligible speech in noisy environments?**
We have trained an ASR model on a Hindi-English mixed dataset comprising app…
-
The f2023 standard C805 says this program is not valid because ```linelen``` is not a specification expression. But lfortran 0.41.0 compiles it and prints '?'
```
integer :: linelen =…
-
In stage 1, only ASR and TTS is used.
ASR is Audio -> Text, so loss is only calculated for text tokens, not for audio tokens right?
TTS is Text -> Audio, but mini-omni outputs text and audio sim…
-
what is the asr tool?
-
Oomptimizer requires a standalone config for evaluation. This doesn't take advantage of hydrarunner as used in other ASR workflows. Would make life easier to allow passing hydrarunner changes.