从音频文件(.wav 格式或者.mp3 格式)生成字幕文件 (.srt 格式) 支不支持识别英语，德语，日语等

lvsh2012 commented 1 year ago

C:\python\PaddleSpeech\demos\streaming_asr_server> paddlespeech_server start --config_file ./conf/ws_conformer_wenetspeech_application.yaml

c:\program files\python38\lib\site-packages\paddleaudio_extension.py:141: UserWarning: paddleaudio C++ extension is not available. sox_io, sox_effect, kaldi raw feature is not supported!!! warnings.warn( [2023-07-19 09:55:09,412] [ INFO] - start to init the engine [2023-07-19 09:55:09,412] [ INFO] - asr : online engine. 2023-07-19 09:55:09.561 | INFO | paddlespeech.s2t.modules.ctc::45 - paddlespeech_ctcdecoders not installed! [2023-07-19 09:55:09,568] [ ERROR] - Failed to start server. [2023-07-19 09:55:09,568] [ ERROR] - Can't find "conformer_online_wenetspeech-en-16k" in resource. Model name must be one of ['conformer_online_wenetspeech-zh-16k', 'conformer_u2pp_online_wenetspeech-zh-16k', 'conformer_online_multicn-zh-16k', 'conformer_online_aishell-zh-16k', 'deepspeech2online_wenetspeech-zh-16k', 'deepspeech2online_aishell-zh-16k'] PS C:\python\PaddleSpeech\demos\streaming_asr_server>

目前只支持中文吗？有没有其他语言识别模型

zxcd commented 1 year ago

服务端目前还有个支持中英文的模型。如果非服务端可以尝试whisper模型，支持的语种更多一些，不过不能实时。中英文模型参考: https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/demos/speech_recognition whisper模型参考: https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/demos/whisper

stale[bot] commented 1 year ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

PaddlePaddle / PaddleSpeech

从音频文件(.wav 格式或者.mp3 格式)生成字幕文件 (.srt 格式) 支不支持识别英语，德语，日语等 #3400

PaddlePaddle / PaddleSpeech

从音频文件(.wav 格式 或者.mp3 格式)生成字幕文件 (.srt 格式) 支不支持 识别英语，德语，日语等 #3400

从音频文件(.wav 格式或者.mp3 格式)生成字幕文件 (.srt 格式) 支不支持识别英语，德语，日语等 #3400