ASR流式服务，在比较安静的环境下，大概2秒之内没有音频传入，在说话就会返回新的内容

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

https://paddlespeech.readthedocs.io

Apache License 2.0

10.55k stars 1.81k forks source link

ASR流式服务，在比较安静的环境下，大概2秒之内没有音频传入，在说话就会返回新的内容 #3254

Open duxb opened 1 year ago

duxb commented 1 year ago

General Question

ASR流式服务，在比较安静的环境下，大概2秒之内没有音频传入，在说话就会返回新的内容，输入end指令后返回的result是最后语音的内容，之前输入的内容没有返回。如何设置这个静音输入的时间，或者有没有其它的解决方式 application.yaml 的参数配置文档在哪里可以查到。

duxb commented 1 year ago

这个现象在 conformer 模型出现，在deepspeech2模型下就没有

zh794390558 commented 1 year ago

配置VAD使用吧。

NLPerxue commented 1 year ago

必须要传音频吗，是否可以传二进制的byte

stale[bot] commented 8 months ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.