microsoft / SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
MIT License
1.09k stars 113 forks source link

VATLM: Error when loading finetuned checkpoints for infer_s2s #54

Open naraysa opened 1 year ago

naraysa commented 1 year ago

Hi,

I am trying to load the finetuned models provided for VATLM. However, I encounter this error where it tries to access a local storage where it was trained. This occurs with all the models you have shared.

The error is:

`File "/home/projects/SpeechT5/VATLM/fairseq/fairseq/distributed/utils.py", line 328, in distributed_main main(cfg, **kwargs)

File "/home/projects/SpeechT5/VATLM/vat_hubert/vathubert/infer_s2s.py", line 93, in main return _main(cfg, h)

File "/home/projects/SpeechT5/VATLM/vat_hubert/vathubert/infer_s2s.py", line 115, in _main models, saved_cfg, task = checkpoint_utils.load_model_ensemble_and_task([cfg.common_eval.path])

File "/home/projects/SpeechT5/VATLM/fairseq/fairseq/checkpoint_utils.py", line 446, in load_model_ensemble_and_task model = task.build_model(cfg.model)

File "/home/projects/SpeechT5/VATLM/fairseq/fairseq/tasks/fairseq_task.py", line 324, in build_model model = models.build_model(cfg, self)

File "/home/projects/SpeechT5/VATLM/fairseq/fairseq/models/init.py", line 96, in build_model return model.build_model(cfg, task)

File "/home/projects/SpeechT5/VATLM/vat_hubert/vathubert/models/vathubert_asr.py", line 400, in build_model state = checkpoint_utils.load_checkpoint_to_cpu(

File "/home/projects/SpeechT5/VATLM/fairseq/fairseq/checkpoint_utils.py", line 303, in load_checkpoint_to_cpu with open(local_path, "rb") as f:

FileNotFoundError: [Errno 2] No such file or directory: '/mnt/default/v-qiushizhu/vatlm_related/results/fbank_large_vox_pretrain_iter5_ext_audio_1_32ngpu_2updatefreq/checkpoints/checkpoint_388_600000.pt' `

I noticed that the shared VATLM models don't have cfg.model.w2v_args in the statedict and are None during the loading.

Would be great if you could help in resolving this.