Closed maptz closed 1 year ago
Oh dear, that was an unfortunate bad first experience. Can you by chance the mp3 audio that's causing this failure?
Actually, it could be a problem that came up with the last version of whisper. I'm investigating such a problem with whisper version 20230306 What gives for you:
pip freeze | grep whisper
Hi @Jeronymous ,
I'm not able to upload the audio file I'm afraid (licensing issues), but I'll try to see if I can find another file that duplicates the issue when using the CUDA device.
FYI, I'm on Windows, so the command above doesn't work verbatim, but I think the information you're after is:
openai-whisper @ git+https://github.com/openai/whisper.git@b80bcf610d89960bc658b61af9c333fc6d978d78
whisper-timestamped==1.10.1
BTW, I though this was restricted to CPU processing, but trying it with another audio file, it seems that the issue occurs with some files in cpu mode too.
Is there a version of whisper that you think works correctly?
Indeed it is a matter of using version 2023036. You can update whisper-timestamped, it should be fixed now. And I recommend for now to use for now
openai-whisper==20230124
(because that version of whisper is super recent and might still include bugs. see https://github.com/openai/whisper/discussions/1058 and I see that whisper repo is moving a lot now)
Thanks.
Hi,
I was trying out this for the first time and run into the following error when using CUDA:
My file is
This works correctly, when the device is specified as "cpu"
Any hints as to how I can solve this?