-
Is it possible to alter the properties of the kaldinnet2onlinedecoder gstreamer element while it is in the pipeline? I'm trying to alter the server so I can send a request to change which FST, model e…
-
**Describe the bug**
When `ctc_weight=1.0`, the ESPnetASRModel should not have a decoder in its parameters, regardless of the decoder config. However, now it will add a default RNN decoder.
Note t…
-
### Describe the issue
I am currently using Whisper for ASR, which is a tiny/base multi-language model generated by Olive. It works fine most of the time, but when I feed Chinese audio to Whisper, …
-
### Question
The lexicon-based beam search decoder currently has a fixed lexicon and thus a closed set of words that an ASR model can recognise. Is there a way to input a list of additional words/phr…
-
Hi, would someone mind taking a look at my setup? Tried multiple different approaches including trying to pass a hard coded string path, but I always get `'_NamespacePath' object is not subscriptable.…
-
## 論文タイトル(原文まま)
The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
## 一言でいうと
離散単位を用いた音声処理の新しいベンチマークを提案し、多言語ASR、TTS、歌声合成の3つのタスクでその有効性を評価するチャレンジを紹介。
### 論文リンク
[arXiv:2406.…
-
首先更改了ASR相关的配置项:
#funasr / ali
ASR_mode=funasr
#ASR二选一(需要运行fay/test/funasr服务)集成达摩院asr项目、感谢中科大脑算法工程师张聪聪提供集成代码
local_asr_ip=0.0.0.0
local_asr_port=10197
funasr成功安装并且按照如下命令启动:
python -u ASR_ser…
-
Notice: In order to resolve issues more efficiently, please raise issue following the template.
(注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节)
## ❓ Questions and Help
中文离线转写服务(CPU)如何配置说话人识别模块, 目前没有看到说话人…
-
In beam search, each result has scores (ctc, lm, ngram), is this a cumulative probability? Because there is only one number. If I want to know the probability of each token, what should I do. Further …
-
I'm trying to transcribe the audio from **pre-trained** model as shown in `Streaming-ASR.ipynb` and `demo_streaming_asr.py`. I have changed the `MODEL_PARAMS` in `frame_asr.py` as shown below (config …