Closed mrmachine closed 1 year ago
Same issue here, using ggml-base.bin
... [00:04:08.760 --> 00:04:09.920] I don't like him very much." [00:04:09.920 --> 00:04:10.920] I don't know. [00:04:10.920 --> 00:04:11.920] I don't know. [00:04:11.920 --> 00:04:12.920] I don't know. [00:04:12.920 --> 00:04:13.920] I don't know. [00:04:13.920 --> 00:04:14.920] I don't know. [00:04:14.920 --> 00:04:15.920] I don't know. [00:04:15.920 --> 00:04:16.920] Hey, talk out. [00:04:16.920 --> 00:04:17.920] I don't know. [00:04:17.920 --> 00:04:18.920] I don't know. [00:04:18.920 --> 00:04:25.400] This is kind of the street ... [00:06:26.960 --> 00:06:27.960] Oh, I loved it. [00:06:27.960 --> 00:06:29.360] It was very different. [00:06:29.360 --> 00:06:30.360] Very different. [00:06:30.360 --> 00:06:31.360] Very different. [00:06:31.360 --> 00:06:33.600] Yeah, especially his fourth student. [00:06:33.600 --> 00:06:35.160] I told you. ...
And I tried to transcribe a file, not from stream.
I attached audio file here.
Not sure fined
similar to #719 and #612
It's definitely a problem with inference. Slicing the audio up and re-transcribing from the failed part works just fine.
Should be resolved via f19e23fbd108ec3ac458c7a19b31c930719e7a94
I pulled the latest code ,still met this issue , I am using ggml-small.en.bin
Is this a general problem with Whisper? Does it occur more often with the stream method? Maybe because it happens more often at the end of a file and the stream method is transcribing a new file every 30 seconds?