I encountered an issue when using the code from the sample Colab file. The code is as follows:
model = whisper.load_model("small.en-tdrz")
result = whisper.transcribe(model, "demo_video.mp4", verbose=True, condition_on_previous_text=False)
# time segments are a bit more stable without prev_text conditioning as it wasn't yet fine-tuned that way
ValueError: tuple.index(x): x not in tuple
Also, I tried the model with MP3 and WAV files but got the same error!
I just tried the Colab notebook as well and run into this issue. Will look into what's up (likely has something to do with this PR https://github.com/akashmjn/tinydiarize/pull/11), thanks for pointing it out.
Issue Description
I encountered an issue when using the code from the sample Colab file. The code is as follows:
Also, I tried the model with MP3 and WAV files but got the same error!