FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model
https://funaudiollm.github.io/
Other
2.61k stars 249 forks source link

KeyError: 94 #119

Closed wangjiancheng-123 closed 1 week ago

wangjiancheng-123 commented 1 week ago

🐛 Bug

File "/home/wjc/anaconda3/envs/cosyvoice/lib/python3.8/site-packages/hydra/_internal/utils.py", line 220, in run_and_report return func() File "/home/wjc/anaconda3/envs/cosyvoice/lib/python3.8/site-packages/hydra/_internal/utils.py", line 458, in lambda: hydra.run( File "/home/wjc/anaconda3/envs/cosyvoice/lib/python3.8/site-packages/hydra/internal/hydra.py", line 132, in run = ret.return_value File "/home/wjc/anaconda3/envs/cosyvoice/lib/python3.8/site-packages/hydra/core/utils.py", line 260, in return_value raise self._return_value File "/home/wjc/anaconda3/envs/cosyvoice/lib/python3.8/site-packages/hydra/core/utils.py", line 186, in run_job ret.return_value = task_function(task_cfg) File "funasr/bin/train.py", line 53, in main_hydra main(kwargs) File "funasr/bin/train.py", line 212, in main trainer.train_epoch( File "/mnt/d/workspace/SenseVoice-main/FunASR-main/funasr/bin/funasr/train_utils/trainer.py", line 382, in train_epoch retval = model(batch) File "/home/wjc/anaconda3/envs/cosyvoice/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, **kwargs) File "/mnt/d/workspace/SenseVoice-main/FunASR-main/funasr/bin/funasr/models/sense_voice/model.py", line 689, in forward encoder_out, encoder_out_lens = self.encode(speech, speech_lengths, text) File "/mnt/d/workspace/SenseVoice-main/FunASR-main/funasr/bin/funasr/models/sense_voice/model.py", line 751, in encode [[self.textnorm_int_dict[int(style)]] for style in text[:, 3]] File "/mnt/d/workspace/SenseVoice-main/FunASR-main/funasr/bin/funasr/models/sense_voice/model.py", line 751, in [[self.textnorm_int_dict[int(style)]] for style in text[:, 3]] KeyError: 94

数据格式: {"key": "Tenor-7_xianggelila_0024", "text_language": "<|zn|>", "emo_target": "<|EMO_UNKNOWN|>", "event_target": "<|Speech|>", "with_or_wo_itn": "<|withitn|>", "target": "你抱着小猫咪蓝眼睛不再忧郁香格里拉在哪里", "source": "song_data/Tenor-7/Tenor-7_xianggelila_0024.wav", "target_len": 20, "source_len": 400} {"key": "Tenor-7_xianggelila_0025", "text_language": "<|zn|>", "emo_target": "<|EMO_UNKNOWN|>", "event_target": "<|Speech|>", "with_or_wo_itn": "<|withitn|>", "target": "让我们去找寻", "source": "song_data/Tenor-7/Tenor-7_xianggelila_0025.wav", "target_len": 6, "source_len": 120} 这样不对吗