Closed Anurag-RTS closed 4 months ago
Remove WithSpeedUp2x, there is no implementation in whisper.cpp anymore, it will always produce an empty result. Support will be added in the future, it was removed because it degraded the quality a lot
@Sing303 Yeah, it works... BUT, that really tanked the performance 😞 From 54s to fricking 47m20s!
Time Taken to init Whisper: 00:00:00.905
⟫ Starting Whisper processing...
00:00:00.000-->00:00:10.000: [MUSIC] [00:07:25.810]
00:00:10.000-->00:00:16.840: I was appointed six months ago. [00:00:00.000]
00:00:16.840-->00:00:19.760: And the more I've spoken about feminism, [00:00:00.000]
00:00:19.760-->00:00:24.600: the more I have realized that fighting for women's rights [00:00:00.000]
00:00:24.600-->00:00:29.960: has too often become synonymous with man-hating. [00:00:00.000]
00:00:29.960-->00:00:34.960: If there is one thing I know for certain, [00:08:04.272]
00:00:34.960-->00:00:38.960: it is that this has to stop. [00:00:00.000]
00:00:38.960-->00:00:45.960: For the record, feminism by definition is the belief [00:00:00.000]
00:00:45.960-->00:00:51.960: that men and women should have equal rights and opportunities. [00:00:00.000]
00:00:51.960-->00:00:56.960: It is the theory of the political, economic, [00:00:00.000]
00:00:56.960-->00:01:01.960: and social equality of the sexes. [00:07:39.712]
00:01:01.960-->00:01:05.960: I started questioning gender-based assumptions a long time ago. [00:00:00.000]
00:01:05.960-->00:01:11.960: When I was eight, I was confused being called bossy, [00:00:00.000]
00:01:11.960-->00:01:16.960: because I wanted to direct the plays that we would put on for our parents. [00:00:00.000]
00:01:16.960-->00:01:19.960: But the boys were not. [00:00:00.000]
00:01:19.960-->00:01:25.960: When at 14, I started to be sexualized by certain elements of the media. [00:00:00.000]
00:01:25.960-->00:01:31.960: When at 15, my girlfriends started dropping out of their beloved sports teams, [00:04:40.371]
00:01:31.960-->00:01:34.960: because they didn't want to appear muscly. [00:00:00.000]
00:01:34.960-->00:01:42.960: When at 18, my male friends were unable to express their feelings. [00:00:00.000]
00:01:42.960-->00:01:49.960: I decided that I was a feminist, and this seemed uncomplicated to me. [00:00:00.000]
00:01:49.960-->00:01:58.960: But my recent research has shown me that feminism has become an unpopular word. [00:04:01.953]
00:01:58.960-->00:02:06.960: Women are choosing not to identify as feminist. [00:00:00.000]
00:02:06.960-->00:02:18.960: Apparently, I am among the ranks of women whose expressions are seen as too strong, too aggressive. [00:00:00.000]
00:02:18.960-->00:02:23.960: Isolating and anti-men. [00:04:03.947]
00:02:23.960-->00:02:27.960: Unattractive, even. [00:00:00.000]
00:02:27.960-->00:02:35.960: Why has the word become such an uncomfortable one? [00:00:00.000]
00:02:35.960-->00:02:44.960: I am from Britain, and I think it is right that I am paid the same as my male counterparts. [00:00:00.000]
00:02:44.960-->00:02:50.960: I think it is right that I should be able to make decisions about my own body. [00:04:40.445]
00:02:50.960-->00:03:00.960: I think it is right that women be involved on my behalf in the policies and the decisions that will affect my life. [00:00:00.000]
00:03:00.960-->00:03:09.960: I think it is right that socially I am afforded the same respect as men. [00:00:00.000]
00:03:09.960-->00:03:22.960: But sadly, I can say that there is no one country in the world where all women can expect to receive these rights. [00:04:20.621]
00:03:22.960-->00:03:30.960: No country in the world can yet say that they have achieved gender equality. [00:00:00.000]
00:03:30.960-->00:03:34.960: Thank you very, very much. [00:00:00.000]
00:03:34.960-->00:03:44.960: [music] [00:01:07.558]
00:03:44.960-->00:03:54.960: [no audio] [00:01:07.983]
⟫ Completed Whisper processing...
54s was on version 1.4.7? Or on version 1.5.0 with WithSpeedUp2x option? If WithSpeedUp2x, then 54s is the model loading time, not the transcribing time, because with WithSpeedUp2x transcribing is not performed at all
54s on v1.4.7
with WithSpeedUp2x
enabled, otherwise took ~2m13s iirc.
because with WithSpeedUp2x transcribing is not performed at all
This has been (still is) in production code, with WithSpeedUp2x
enabled, with daily ~25 videos/audios getting transcribed then processed. But now, atleast on my machine, I can't reproduce that old behavior even if I go back to v1.4.7
. 🤔
Ok, I was misremembering my old version. After downgrading to v1.4.6
, I got back the sub 1-min transcription times. I'll wait until ggerganov re-enables WithSpeedUp2x
option in base library.
After upgrading to v1.5.0, I can no longer elicit any output using Whisper.net whereas it used to work flawlessly. Nothing has been changed in the POC env other than Whisper.net library version. Also, I noticed that I don't see Whisper debug logs anymore, but that's probably unrelated to this issue.
Program.cs (slightly modified from
examples/Simple/Program.cs
)