issues
search
m-bain
/
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
BSD 2-Clause "Simplified" License
12.61k
stars
1.34k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Are these warning about torch version or pyannote version important?
#933
ywangwxd
opened
4 hours ago
0
Default config of `without_timestamps=True` affects whisper transcript quality.
#932
Artaches
opened
12 hours ago
0
WhisperX missing pieces of transcript compared to Whisper API
#931
tomhayw
opened
18 hours ago
1
Update MANIFEST.in to include necessary files
#930
frostming
opened
22 hours ago
0
Solution for Timestamps Not Appearing When Using Other Languages Like English in Korean Language Models
#929
THePhanT00M
opened
1 day ago
0
crisper whisper just pluggable?
#928
kunibald413
opened
1 day ago
0
A module that was compiled using NumPy 1.x cannot be run in NumPy 2.0.1 as it may crash
#927
2424004764
opened
4 days ago
11
Chunking with stride
#926
pramadikaegamo
opened
5 days ago
0
FileNotFoundError
#925
diaverso
opened
5 days ago
0
whisperx.DiarizationPipeline load long time
#924
smallpize
opened
6 days ago
0
Support Arabic Language
#923
abdelkrimkr
opened
1 week ago
0
Feat: add new align models - SHORT
#922
Equipo45
opened
1 week ago
0
WhisperX can Generate the N-best (top few) hypotheses?
#921
hpjang
opened
1 week ago
0
Fail to generate segment
#920
leinace1001
closed
1 week ago
1
Any ways to reduce or calibrate the offset of word timeline?
#919
leinace1001
opened
2 weeks ago
0
TranscriptionOptions.__new__() missing 1 required positional argument: 'hotwords'
#918
Tejes
opened
2 weeks ago
2
supress_numerals is eliminating numbers from transcription, not considering them words.
#917
juangea
opened
2 weeks ago
0
Regarding the issue of sentence length
#916
heartInsert
opened
3 weeks ago
2
API server hangs after a certain period
#915
dineshveguru
closed
2 weeks ago
1
I need advice on the Wav2Vec2 English model.
#914
sulutian
opened
3 weeks ago
0
feat: Enable optional dynamic prompting for the FasterWhisperPipeline
#913
jameshu88
opened
3 weeks ago
0
Allows n_samples to be passed in detect_language
#912
marcelovjunior
opened
3 weeks ago
0
Why do the numbers in the ASR results not have a start and end timestamp?
#911
hpjang
opened
3 weeks ago
4
Word-level timestamps not working with python implementation
#910
rkulyassa
opened
3 weeks ago
0
Dockerfile for transcription and Speaker Diarization
#909
kowshik24
opened
3 weeks ago
3
Finetuned large-v3 inference problem.
#908
sinisha
opened
4 weeks ago
1
Phoneme-Based ASR For Arabic
#907
MustaphaLargou25
opened
4 weeks ago
0
Bad things Error!
#906
Chiyan200
opened
1 month ago
1
whisperX not working with Google Collab?
#905
m01ali
opened
1 month ago
2
Compatible with latest faster-whisper
#904
latent-variable
opened
1 month ago
0
libcudnn_cnn.so.9.1.0 issue
#903
kowshik24
opened
1 month ago
2
Unable to load any of {libcudnn_cnn.so.9.1.0, libcudnn_cnn.so.9.1, libcudnn_cnn.so.9, libcudnn_cnn.so}
#902
Leandrocnf
closed
1 month ago
7
WhisperX in Google colab Unable to load any of {libcudnn_ops.so.9.1.0, libcudnn_ops.so.9.1, libcudnn_ops.so.9, libcudnn_ops.so}
#901
sijitang
closed
1 month ago
11
Multiple improvements: language detection per segment, VAD min duration on/off, unique speakers, pyproject.toml and more.
#900
cvl01
opened
1 month ago
2
Could not locate `cudnn_ops_infer64_8.dll`. Please make sure it is in your library path!
#899
YoungPhlo
opened
1 month ago
3
Whisper large V3 turbo support?
#898
utility-aagrawal
opened
1 month ago
4
Model is Downloaded but not loaded jonatasgrosman--wav2vec2-large-xlsr-53-japanese
#897
andriken
opened
1 month ago
0
whisperX removing silence / pauses
#896
tsmdt
opened
1 month ago
0
Turbo-V3
#894
brainer3220
opened
1 month ago
13
Why can't we do multilanguage forced aligment without loading a language-specific alignment model?
#893
empz
opened
1 month ago
2
Return top-k detected languages with probabilities
#892
danielmunioz
opened
1 month ago
0
whisper based simple cross-lingual speech recognition demo
#891
pika-online
opened
1 month ago
0
cpu utilisation maxes at 50% (conda?)
#890
chboishabba
opened
1 month ago
0
[Feature] Silero VAD support
#889
3manifold
opened
2 months ago
0
Silero VAD support
#888
3manifold
opened
2 months ago
8
RuntimeError: No position encodings are defined for positions >= 448, but got position 448
#887
RichardQin1
opened
2 months ago
0
How to load model?
#886
salekeennayeem
closed
1 month ago
1
main branch code is not consistent with 3.1.1 release
#885
sabn0
opened
2 months ago
0
compute_type whisperX transcription - option to use float32?
#884
valericac
closed
2 months ago
1
Just use this script to make the srt more readable for the end results. almost perfect, try it and share your thoughts.
#883
search620
opened
2 months ago
2
Next