A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
NeMo/examples/asr/vad_infer.py script fails with IndexError if option --dont_auto_split is not set and at least 1 .wav file is long enough so that nemo.collections.asr.parts.utils.vad_utils.prepare_manifest() function split the .wav file.
Describe the bug
NeMo/examples/asr/vad_infer.py
script fails withIndexError
if option--dont_auto_split
is not set and at least 1.wav
file is long enough so thatnemo.collections.asr.parts.utils.vad_utils.prepare_manifest()
function split the.wav
file.Steps/Code to reproduce bug
Expected behavior
No errors
Environment overview (please complete the following information)
Environment details
If NVIDIA docker image is used you don't need to specify these. Otherwise, please provide: