yeyupiaoling / MASR

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
Apache License 2.0
563 stars 100 forks source link

运行train.py报错 #69

Closed happywch closed 5 months ago

happywch commented 8 months ago

Traceback (most recent call last): 'feature', 'params': {'max_t': 30, 'num_t_sub': 3}, 'prob': 1.0} File "c:\Users\ww\Desktop\MASR-release-2.3.x\train.py", line 26, in trainer.train(save_model_path=args.save_model_path, File "c:\Users\ww\Desktop\MASR-release-2.3.x\masr\trainer.py", line 482, in train self.setup_dataloader(augment_conf_path=augment_conf_path, is_train=True) File "c:\Users\ww\Desktop\MASR-release-2.3.x\masr\trainer.py", line 99, in setup_dataloader self.train_loader = DataLoader(dataset=self.train_dataset, ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\miniconda\envs\CBVR\Lib\site-packages\torch\utils\data\dataloader.py", line 245, in init raise ValueError('prefetch_factor option could only be specified in multiprocessing.' ValueError: prefetch_factor option could only be specified in multiprocessing.let num_workers > 0 to enable multiprocessing, otherwise set prefetch_factor to None.

报错如上,在create_data.py 时并没有使用python create_data.py --is_merge_audio=True来生成二进制,是这个原因引起的吗。目前已经在dataset下生成manifest.test、manifest.train、mean_istd .json、vocabulary.txt都已经生成并且都正常

yeyupiaoling commented 6 months ago

这个是最新版本pytorch的原因,代码已经修复了。