Closed lawik closed 11 months ago
Looking at small
and small.en
they seem to lack task_to_id
:
https://huggingface.co/openai/whisper-small/blob/main/generation_config.json
vs
https://huggingface.co/openai/whisper-small.en/blob/main/generation_config.json
Something similar: https://github.com/huggingface/transformers/issues/25084
Looking at openai implementation, for the monolingual model they don't include the task token. So passing task: nil
to the serving is the way to go. I added an error that suggests that (1020c752ff8fb9a0738620d5452306da373cddd8).
Thanks!
The english-specific models are smaller, faster and more effective if you know you are dealing with english. Or so I gather.
Trying the regular Livebook Neural Network Smart cell, editing it and switching in
.en
on the model doesn't run:This will fail with weird configuration errors:
That will give:
Changing it to this runs but produces an empty transcript:
And the default works fine.
I have used
tiny.en
and friends beforegeneration_config
was added at all.