Open sixTiger opened 1 year ago
我也遇到了相同的问题
2023-09-07 19:45:00.950 | INFO | paddlespeech.s2t.modules.embedding:init:150 - max len: 5000
[2023-09-07 19:45:04,539] [ ERROR] - list index out of range
Traceback (most recent call last):
File "/Users/yangpeng/anaconda3/envs/python_3_9/lib/python3.9/site-packages/paddlespeech/cli/asr/infer.py", line 314, in infer
result_transcripts = self.model.decode(
File "/Users/yangpeng/anaconda3/envs/python_3_9/lib/python3.9/site-packages/decorator.py", line 232, in fun
return caller(func, *(extras + args), kw)
File "/Users/yangpeng/anaconda3/envs/python_3_9/lib/python3.9/site-packages/paddle/fluid/dygraph/base.py", line 347, in _decorate_function
return func(*args, *kwargs)
File "/Users/yangpeng/anaconda3/envs/python_3_9/lib/python3.9/site-packages/paddlespeech/s2t/models/u2/u2.py", line 818, in decode
hyp = self.attention_rescoring(
File "/Users/yangpeng/anaconda3/envs/python_3_9/lib/python3.9/site-packages/paddlespeech/s2t/models/u2/u2.py", line 532, in attention_rescoring
assert speech.shape[0] == speech_lengths.shape[0]
IndexError: list index out of range
Traceback (most recent call last):
File "/Users/yangpeng/PyCharmProjects/Foo/foo/main.py", line 41, in
指定model为conformer_wenetspeech就不报错,默认模型应该是conformer_u2pp_online_wenetspeech,奇怪的是命令行模式不报错
from paddlespeech.cli.asr.infer import ASRExecutor
asr = ASRExecutor()
result = asr(audio_file="zh.wav",model="conformer_wenetspeech")
print(result)
使用model="conformer_online_wenetspeech"同样报错,conformer_wenetspeech也不行,Ubuntu 20环境,paddlespeech 1.4.1.
@zxcd
我这边遇到这个错误是因为采样率设置异常。设置了24000,发现只支持0、8000、16000三种。 刚开始以为是运行环境没搞好,各种调整库的版本组合,折腾了很久才找到根本原因。 供参考。
指定model为conformer_wenetspeech就不报错,默认模型应该是conformer_u2pp_online_wenetspeech,奇怪的是命令行模式不报错
from paddlespeech.cli.asr.infer import ASRExecutor asr = ASRExecutor() result = asr(audio_file="zh.wav",model="conformer_wenetspeech") print(result)
加上model="conformer_wenetspeech"还是不行。我是paddlepaddle-gpu 2.5.2改成2.4.2就可以正常运行了。
指定model为conformer_wenetspeech就不报错,默认模型应该是conformer_u2pp_online_wenetspeech,奇怪的是命令行模式不报错
from paddlespeech.cli.asr.infer import ASRExecutor asr = ASRExecutor() result = asr(audio_file="zh.wav",model="conformer_wenetspeech") print(result)
加上model="conformer_wenetspeech"还是不行。我是paddlepaddle-gpu 2.5.2改成2.4.2就可以正常运行了。
感觉是paddlepaddle-gpu引起的问题,更换到2.4.2版本后问题解决
我运行示例代码直接报错
错误信息如下
我的环境配置