Fsmn_vad参数speech_to_sil_time_thres

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

https://www.funasr.com

Other

7.1k stars 754 forks source link

Fsmn_vad参数speech_to_sil_time_thres #1973

Open tesfayetong opened 4 months ago

tesfayetong commented 4 months ago

❓ Questions and Help

为啥speech_to_sil_time_thres设置的越大，切割出的音频会越多呢？

tesfayetong commented 3 months ago

这个参数的意思是声音到静音的最长时间阈值么？为什么我设置的越小，反而切分出的结果也越少了呢，照理说阈值减小了应该越多？