NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html
Apache License 2.0
12.14k stars 2.53k forks source link

Unable to decode using canary 1b model #10680

Closed uni-saurabh-vyas closed 3 days ago

uni-saurabh-vyas commented 1 month ago

Describe the bug

A clear and concise description of what the bug is.

I am using nemo inside docker environment, with single GPU, unable to get the ASR ouput using canary 1b model

Error executing job with overrides: ['pretrained_name=nvidia/canary-1b', 'audio_dir=/opt/workspace/es_test', 'output_filename=canary_output.txt', 'clean_groundtruth_text=True', 'langid=es', 'batch_size=1', 'compute_timestamps=False', 'compute_langs=False', 'cuda=0', 'amp=True', 'append_pred=False', 'pred_name_postfix=canary']
Traceback (most recent call last):
  File "/opt/asr_ensemble/repo/nemo/examples/asr/transcribe_speech.py", line 290, in main
    asr_model.change_decoding_strategy(cfg.ctc_decoding)
  File "/usr/local/lib/python3.10/dist-packages/nemo/collections/asr/models/aed_multitask_models.py", line 203, in change_decoding_strategy
    self.decoding = MultiTaskDecoding(
  File "/usr/local/lib/python3.10/dist-packages/nemo/collections/asr/parts/submodules/multitask_decoding.py", line 377, in __init__
    super().__init__(
  File "/usr/local/lib/python3.10/dist-packages/nemo/collections/asr/parts/submodules/multitask_decoding.py", line 124, in __init__
    raise NotImplementedError("Greedy decoding is not implemented yet.")
NotImplementedError: Greedy decoding is not implemented yet.

Steps/Code to reproduce bug

python3 nemo/examples/asr/transcribe_speech.py \
pretrained_name="nvidia/canary-1b" \
audio_dir="/opt/workspace/es_test" \
output_filename="canary_output.txt" \
clean_groundtruth_text=True \
langid='es' \
batch_size=1 \
compute_timestamps=False \
compute_langs=False \
cuda=0 \
amp=True \
append_pred=False \
pred_name_postfix="canary"

Expected behavior

A clear and concise description of what you expected to happen.

Environment overview (please complete the following information)

Environment details

If NVIDIA docker image is used you don't need to specify these. Otherwise, please provide:

Additional context

Add any other context about the problem here. Example: GPU model

pzelasko commented 1 month ago

Which NeMo version is this? Greedy decoding for Canary should work if you install the latest 2.0 pre-release, or the latest main branch.

github-actions[bot] commented 1 week ago

This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.

github-actions[bot] commented 3 days ago

This issue was closed because it has been inactive for 7 days since being marked as stale.