Closed jingsupo closed 3 weeks ago
能够复现吗
可以啊,每一次都出这个错,会不会是我的portaudio相关依赖的问题?关键是这个错误提示看不出问题出在哪啊?
非Windows能复现吗
非Windows没试过
我在阿里云的服务器上测试了Linux环境,由于没有音频设备,报了下面的错误
2024/04/06 12:52:21.578184 Failed to get default input device: %v
no default input device
不过我在阿里云的服务器上测试了non-streaming-decode-files,这个直接进行音频的语音识别的,所以运行正常
[root@iZ2zeatc350vkqmd1l53tjZ non-streaming-decode-files]# ./run-paraformer.sh
2024/04/06 12:56:21.554292 Reading ./sherpa-onnx-paraformer-trilingual-zh-cantonese-en/test_wavs/3-sichuan.wav
2024/04/06 12:56:21.571568 Initializing recognizer (may take several seconds)
2024/04/06 12:56:26.031247 Recognizer created!
2024/04/06 12:56:26.031276 Start decoding!
/project/sherpa-onnx/csrc/offline-paraformer-greedy-search-decoder.cc:Decode:65 time stamp for batch: 0, 40 vs -1
2024/04/06 12:56:26.626338 Decoding done!
2024/04/06 12:56:26.626369 自己就是在那个在那个就是在情节里面就是感觉是演得特别好就是好像很真实一样你知道吧
2024/04/06 12:56:26.626392 Wave duration: 7.835 seconds
那Windows上可以跑这个识别文件的例子么
今天又重新编译了,可以运行成功~
go version go1.22.2 windows/amd64
D:\a\sherpa-onnx\sherpa-onnx\sherpa-onnx\c-api\c-api.cc:SherpaOnnxCreateVoiceActivityDetector:812 VadModelConfig(silero_vad=SilerVadModelConfig(model="./silero_vad.onnx", threshold=0.5, min_silence_duration=0.5, min_speech_duration=0.25, window_size=512), sample_rate=16000, num_threads=1, provider="cpu", debug=True)
D:\a\sherpa-onnx\sherpa-onnx\sherpa-onnx\c-api\c-api.cc:CreateOfflineRecognizer:398 OfflineRecognizerConfig(feat_config=FeatureExtractorConfig(sampling_rate=16000, feature_dim=80, low_freq=20, high_freq=-400, dither=0), model_config=OfflineModelConfig(transducer=OfflineTransducerModelConfig(encoder_filename="", decoder_filename="", joiner_filename=""), paraformer=OfflineParaformerModelConfig(model="./sherpa-onnx-paraformer-trilingual-zh-cantonese-en/model.int8.onnx"), nemo_ctc=OfflineNemoEncDecCtcModelConfig(model=""), whisper=OfflineWhisperModelConfig(encoder="", decoder="", language="", task="transcribe", tail_paddings=-1), tdnn=OfflineTdnnModelConfig(model=""), zipformer_ctc=OfflineZipformerCtcModelConfig(model=""), wenet_ctc=OfflineWenetCtcModelConfig(model=""), telespeech_ctc="", tokens="./sherpa-onnx-paraformer-trilingual-zh-cantonese-en/tokens.txt", num_threads=2, debug=True, provider="cpu", model_type="", modeling_unit="", bpe_vocab=""), lm_config=OfflineLMConfig(model="", scale=1), ctc_fst_decoder_config=OfflineCtcFstDecoderConfig(graph="", max_active=3000), decoding_method="greedy_search", max_active_paths=4, hotwords_file="", hotwords_score=1.5, blank_penalty=0, rule_fsts="", rule_fars="")
2024/07/02 15:42:21.264935 Selected default input device: 麦克风阵列 (Realtek High Definition
2024/07/02 15:42:21.347389 Started! Please speak
2024/07/02 15:42:52.563583 Detected speech
2024/07/02 15:42:54.272830 Duration: 1.51 seconds
2024/07/02 15:42:54.348905 今天天气不错
2024/07/02 15:42:54.349189 Saved to seg-0-1.51-seconds-今天天气不错.wav
2024/07/02 15:42:54.349711 ----------
太棒啦!可以关闭了么?
我来关闭吧^^
Example: sherpa-onnx/go-api-examples/vad-asr-paraformer
go version go1.22.0 windows/amd64
bug info: