davabase / transcriber_app

Real time speech to text transcription app.
363 stars 69 forks source link

Faster-Whisper Support? #24

Open JvilleDev opened 1 year ago

JvilleDev commented 1 year ago

Can u add Faster-whisper support? Or How can i do that?

NosgothicRealm commented 1 year ago

Hello! I tried to implement this app using WhisperX because of the diarization option and the faster_whisper's models.

But I have problems parsing the audio to Whisperx This app 'normalizes' the audio and that works for openai's whisper, but not for Whisperx (I'm no expert in audio format)

If at some point I can implement this app using faster_whisper/whisperx I'll let you know.

sorry for my English!

lukkzien commented 10 months ago

Hello! I tried to implement this app using WhisperX because of the diarization option and the faster_whisper's models.

But I have problems parsing the audio to Whisperx This app 'normalizes' the audio and that works for openai's whisper, but not for Whisperx (I'm no expert in audio format)

If at some point I can implement this app using faster_whisper/whisperx I'll let you know.

sorry for my English!

Hello, were you able to accomplish it?

NosgothicRealm commented 10 months ago

But I have problems parsing the audio to Whisperx This app 'normalizes' the audio and that works for openai's whisper, but not for Whisperx (I'm no expert in audio format)

If at some point I can implement this app using faster_whisper/whisperx I'll let you know.

Until some point, yes, but I'm still having issues because there are still problems when audio is shared using 'normalization' I'm trying to implement something similar, but using WhisperX because that repo uses the diarization too.

If I have something good for real-time I'll share it :)

HardikJain02 commented 6 months ago

Have you got anything for real-time?