-
Hello!
I've been using the WhisperX large-v2 model in English on a project to transcribe vocals taken from songs, which I derive using source separation with spleeter. If it matters, I've been runn…
-
6 months ago I started following the whisperx project and I haven't looked at it for a long time, I think something has changed in the project or something is wrong with me, I did a lot of research an…
-
Hello, I am experiencing some issues while generating subtitles for the video attached below. Despite trying various values in the Advanced Parameters and Voice Detection sections, I am not able to ac…
-
There's an extension to whisper called whisperX , why not use it? it has built in diarization as well as large files chunking and enhanced audio segment alignment.
https://github.com/m-bain/whisperX
…
-
I noticed that the "end" timestamp for the last phoneme is often off for the Dutch language, leading to the duration of the last phoneme being way longer than the actual utterance.
As you can see i…
-
https://hearhear.vercel.app/ is an implementation of the desired effect.
Shappie can use WhsiperX(https://github.com/m-bain/whisperX) to record transcriptions of meetings in real time. After the me…
-
(just more of an FYI)
Was hoping to use the latest version but bump up some of the requirement versions (e.g. faster-whisper), but looks like the [3.1.5 release](https://pypi.org/project/whisperx/3…
-
Here's a simple script to identify language from an audio:
import whisperx
import time
start_time = time.time()
filepath = ""
whisper_model = whisperx.load_model("medium", device = "cuda", …
-
Find out what the expected error rate is, find out what's possible.
https://github.com/kadirnar/whisper-plus
Papers
https://arxiv.org/pdf/2212.04356
WER:
https://pubs.aip.org/asa/jel/articl…
-
In the warnings, if you actually run this command to upgrade the loaded checkpoint:
> Lightning automatically upgraded your loaded checkpoint from v1.5.4 to v2.0.6. To apply the upgrade to your fil…