MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
BSD 2-Clause "Simplified" License
3.28k stars 272 forks source link

EOF Error when running diarize.py #68

Closed c-perumalla closed 4 months ago

c-perumalla commented 1 year ago

Windows 11 GPU, cuda12.2

Traceback (most recent call last): File "C:\Users\calvinap\miniconda3\lib\runpy.py", line 188, in _run_module_as_main mod_name, mod_spec, code = _get_module_details(mod_name, _Error) File "C:\Users\calvinap\miniconda3\lib\runpy.py", line 111, in _get_module_details import(pkg_name) File "C:\Users\calvinap\whisper-diarization\diarize.py", line 118, in msdd_model.diarize() File "C:\Users\calvinap\whisper-diarization\env\lib\site-packages\torch\utils_contextlib.py", line 115, in decorate_context return func(*args, **kwargs) File "C:\Users\calvinap\whisper-diarization\env\lib\site-packages\nemo\collections\asr\models\msdd_models.py", line 1180, in diarize self.clustering_embedding.prepare_cluster_embs_infer() File "C:\Users\calvinap\whisper-diarization\env\lib\site-packages\nemo\collections\asr\models\msdd_models.py", line 699, in prepare_cluster_embs_infer self.emb_sess_test_dict, self.emb_seq_test, self.clus_test_labeldict, = self.run_clustering_diarizer( File "C:\Users\calvinap\whisper-diarization\env\lib\site-packages\nemo\collections\asr\models\msdd_models.py", line 866, in run_clustering_diarizer scores = self.clus_diar_model.diarize(batch_size=self.cfg_diar_infer.batch_size) File "C:\Users\calvinap\whisper-diarization\env\lib\site-packages\nemo\collections\asr\models\clustering_diarizer.py", line 437, in diarize self._perform_speech_activity_detection() File "C:\Users\calvinap\whisper-diarization\env\lib\site-packages\nemo\collections\asr\models\clustering_diarizer.py", line 325, in _perform_speech_activity_detection self._run_vad(manifest_vad_input) File "C:\Users\calvinap\whisper-diarization\env\lib\site-packages\nemo\collections\asr\models\clustering_diarizer.py", line 218, in _run_vad for i, test_batch in enumerate( File "C:\Users\calvinap\whisper-diarization\env\lib\site-packages\tqdm\std.py", line 1178, in iter for obj in iterable: File "C:\Users\calvinap\whisper-diarization\env\lib\site-packages\torch\utils\data\dataloader.py", line 441, in iter return self._get_iterator() File "C:\Users\calvinap\whisper-diarization\env\lib\site-packages\torch\utils\data\dataloader.py", line 388, in _get_iterator return _MultiProcessingDataLoaderIter(self) File "C:\Users\calvinap\whisper-diarization\env\lib\site-packages\torch\utils\data\dataloader.py", line 1042, in init w.start() File "C:\Users\calvinap\miniconda3\lib\multiprocessing\process.py", line 121, in start self._popen = self._Popen(self) File "C:\Users\calvinap\miniconda3\lib\multiprocessing\context.py", line 224, in _Popen return _default_context.get_context().Process._Popen(process_obj) File "C:\Users\calvinap\miniconda3\lib\multiprocessing\context.py", line 327, in _Popen return Popen(process_obj) File "C:\Users\calvinap\miniconda3\lib\multiprocessing\popen_spawn_win32.py", line 93, in init reduction.dump(process_obj, to_child) File "C:\Users\calvinap\miniconda3\lib\multiprocessing\reduction.py", line 60, in dump ForkingPickler(file, protocol).dump(obj) _pickle.PicklingError: Can't pickle <class 'nemo.collections.common.parts.preprocessing.collections.SpeechLabelEntity'>: attribute lookup SpeechLabelEntity on nemo.collections.common.parts.preprocessing.collections failed Traceback (most recent call last): File "", line 1, in File "C:\Users\calvinap\miniconda3\lib\multiprocessing\spawn.py", line 116, in spawn_main exitcode = _main(fd, parent_sentinel) File "C:\Users\calvinap\miniconda3\lib\multiprocessing\spawn.py", line 126, in _main self = reduction.pickle.load(from_parent) EOFError: Ran out of input

ScriptTiger commented 1 year ago

@c-perumalla, just to clarify, it was working for you before, but then you just recently starting getting this error?