modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
https://www.funasr.com
Other
6.96k stars 739 forks source link

websocket service cannot do real-time recognition after deployment with docker #2093

Closed ruifengma closed 1 month ago

ruifengma commented 1 month ago

Notice: In order to resolve issues more efficiently, please raise issue following the template. (注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节)

❓ Questions and Help

Before asking:

  1. search the issues.
  2. search the docs.

What is your question?

Code

nohup bash run_server.sh \
  --download-model-dir /workspace/models \
  --vad-dir damo/speech_fsmn_vad_zh-cn-16k-common-onnx \
  --model-dir damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-onnx  \
  --punc-dir damo/punc_ct-transformer_cn-en-common-vocab471067-large-onnx \
  --lm-dir damo/speech_ngram_lm_zh-cn-ai-wesp-fst \
  --itn-dir thuduj12/fst_itn_zh \
  --hotword /workspace/models/hotwords.txt \
  --certfile 0 > log.txt 2>&1 &

What have you tried?

I have deploy funasr successfully and tested with the client which can be used for audio recognition, but when I use the example html (index.html) and tried to connect my service, it cannot really react as real time recognition, but only reply me after I click the stop button (which is not the same as the demo from official website)

What's your environment?

LauraGPT commented 1 month ago

Sorry, the realtime service does not support English so far. We would support it soon.

ruifengma commented 1 month ago

Sorry, the realtime service does not support English so far. We would support it soon.

Thanks for the reply, is Chinese supported by realtime websocket service? How could I specify the parameter? Thanks