modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
https://www.funasr.com
Other
6.48k stars 688 forks source link

离线paraformer推理报错 #1432

Closed dospeech closed 7 months ago

dospeech commented 7 months ago

Help

3.5号刚git clone的新代码,运行infer.shhou 报错,不知道什么原因,情况如下:

root@9df7eea98999:/workspace/20240305_funasr/FunASR/examples/industrial_data_pretraining/paraformer# ./infer.sh

If you want to use hugging, please pip install -U transformers /usr/local/lib/python3.8/runpy.py:127: RuntimeWarning: 'funasr.bin.inference' found in sys.modules after import of package 'funasr.bin', but prior to execution of 'funasr.bin.inference'; this may result in unpredictable behaviour warn(RuntimeWarning(msg)) [2024-03-05 13:20:05,566][root][INFO] - download models from model hub: ms [2024-03-05 13:20:11,621][root][INFO] - Loading pretrained params from exp/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch/model.pt ckpt: exp/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch/model.pt Error executing job with overrides: ['++model=exp/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch', '++model_revision=v2.0.4', '++input=exp/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch/example/asr_example.wav', '++output_dir=./outputs/debug', '++device=cuda:0'] Traceback (most recent call last): File "/workspace/20240305_funasr/FunASR/funasr/bin/inference.py", line 25, in main_hydra model = AutoModel(kwargs) File "/workspace/20240305_funasr/FunASR/funasr/auto/auto_model.py", line 102, in init model, kwargs = self.build_model(kwargs) File "/workspace/20240305_funasr/FunASR/funasr/auto/auto_model.py", line 191, in build_model load_pretrained_model( File "/workspace/20240305_funasr/FunASR/funasr/train_utils/load_pretrained_model.py", line 68, in load_pretrained_model src_state = torch.load(path, map_location=map_location) File "/usr/local/lib/python3.8/site-packages/torch/serialization.py", line 795, in load return _legacy_load(opened_file, map_location, pickle_module, pickle_load_args) File "/usr/local/lib/python3.8/site-packages/torch/serialization.py", line 1002, in _legacy_load magic_number = pickle_module.load(f, pickle_load_args) _pickle.UnpicklingError: invalid load key, 'v'.

Set the environment variable HYDRA_FULL_ERROR=1 for a complete stack trace.

#########. infer.sh Code 如下: input="exp/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch/example/asr_example.wav"

output_dir="./outputs/debug"

model="exp/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch"

model="exp/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch" model_revision="v2.0.4"

device="cuda:0" # "cuda:0" for gpu0, "cuda:1" for gpu1, "cpu"

python -m funasr.bin.inference \ ++model=${model} \ ++model_revision=${model_revision} \ ++input="${input}" \ ++output_dir="${output_dir}" \ ++device="${device}" \

What have you tried?

只改变音频输入路径

What's your environment?

LauraGPT commented 7 months ago

Please check your model path.