Closed triumph9989 closed 2 years ago
Please refer this issue: https://github.com/NVIDIA/NeMo/issues/4157 I also faced this issue its a VAD model issue, if you use VAD_Telephony_marblenet.nemo. It will work without any error.
When I run from base (no virtual environment) on some machine it run well even with marblenet only.
You can run notebook tutorial it will work. https://github.com/NVIDIA/NeMo/blob/main/tutorials/speaker_tasks/Speaker_Diarization_Inference.ipynb
@alamnasim Yes. I have tried to replace VAD with Telephony_marblenet and it works for me. I really appreciate your help : )
@fayejf FYI
I used VAD_Telephony_marblenet but still it gets stuck at generating predictions. Any solutions?
For temporary fix, change num_workers to 1.
I have tried to run it on Ubuntu 22.04.3 LTS with 8 cpu cores. When i changedconfig.num_workers = 8
to config.num_workers = 1
the freeze has gone and it worked. I did not change vad model, still works with pretrained_vad = 'vad_multilingual_marblenet'.
Describe the bug Hi, I'm new to nemo system. I have no idea why my program stops at "generating predictions with overlapping input segments".
Steps/Code to reproduce bug
python offline_diarization.py
my manifest.json
{"audio_filepath": "/home/ec5017b/media-lab/nemo/NeMo/examples/speaker_tasks/diarization/data/voxconverse/voxconverse_test_wav/voxconverse_test_wav/aepyx.wav", "offset": 0, "duration": null, "label": "infer", "text": "-"}
offline_diarization.yaml
Expected behavior
I hope to finish the inference.
Environment overview
Environment details
Additional context I have not prepared Megatron GPT & numba Moreover, I only use the aepyx.wav in the Voxconverse test.