nkilm / offline-whisperx

Run different pipelines of WhisperX - Transcription, Diarization, VAD, Alignment completely OFFLINE.
17 stars 1 forks source link

Confidence scores #3

Open brettdavies opened 2 months ago

brettdavies commented 2 months ago

Hi @nkilm! I found your repo, which does most of what I want. Did you happen to implement the confidence score functionality from the whisperX library? If not, I'm happy to do the work if you can point me in the right direction.

🎯 Accurate word-level timestamps using wav2vec2 alignment

Kind regards.

nkilm commented 2 months ago

Hi @brettdavies👋

By default, confidence scores are also being added to the output after Alignment. Eg: Checkout the sample output for an audio sample.json, it has score for each of the segments. Please feel free to correct me if I'm wrong here:)

Regards, Nikhil

https://github.com/nkilm/offline-whisperx/blob/a07bbe5ac15b5e114fc448b2dcf864898e50a08a/output/sample.json#L3-L17