issues
search
m-bain
/
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
BSD 2-Clause "Simplified" License
12.61k
stars
1.33k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Updated asr.py to allow replacing initial_prompt on transcribe method.
#882
clararoft
opened
2 months ago
1
Question on the pseudo code of arxiv paper
#881
treya-lin
opened
2 months ago
0
Using large-v3 returns some segments in all uppercase
#880
caryknoop
opened
2 months ago
1
I’ve successfully installed WhisperX, is there anything I can uninstall to save some disk space?
#879
MituButChi
opened
2 months ago
2
ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation.
#878
kc01-8
opened
2 months ago
1
bump `pyannote.audio` version to 3.3.1
#877
JSchmie
opened
2 months ago
0
Feature/segement printing
#876
Hasan-Naseer
opened
2 months ago
6
latest faster-whisper support added
#875
Hasan-Naseer
opened
2 months ago
3
Split words separated by hyphens
#874
atferrys
opened
2 months ago
0
Use whisperx diarization offline
#873
itaipee
opened
2 months ago
2
Can Hard Coded Hyperparameters be moved to a config file?
#872
morsczx
closed
2 months ago
4
Parameter to enable verbose/Segment level printing for better debugging
#871
Hasan-Naseer
opened
2 months ago
1
Open PR to add latest version of faster-whisper
#870
Hasan-Naseer
opened
2 months ago
1
Wav2vec doesn't align numerical characters
#869
pr-data-port
opened
2 months ago
1
KeyError 'en'
#868
pr-data-port
opened
3 months ago
1
Added local_files_only option on whisperx.load_model for offline mode
#867
RoqueGio
opened
3 months ago
0
Transcribing error
#866
Siddiq199
opened
3 months ago
2
Could not load library libcudnn_ops_infer.so.8. Error: libcudnn_ops_infer.so.8: cannot open shared object file: No such file or directory
#865
IgorEzerskiy
closed
3 months ago
4
Fix/faster whisper issue
#864
matiasfnunezdev
closed
3 months ago
0
Implement PR improvements
#863
matiasfnunezdev
closed
3 months ago
0
transcribe: pass `suppress_numerals` into function to control per request
#862
eschmidbauer
opened
3 months ago
0
Is there a way to transcribe multiple audio files asynchronously/parallel with whisperX?
#861
imc-db
opened
3 months ago
2
WhisperX just stops at Diarization
#860
Eli-117
closed
3 months ago
0
Version 3.1.5 is distributed on pypi but Github repo only has 3.1.1?
#859
dannguyen
opened
3 months ago
3
How to enable diarization in python code (not terminal)?
#858
imc-db
closed
3 months ago
2
Turning off timestamps?
#857
IndolentKheper
opened
3 months ago
1
not being able to pickup words at the last of the audio's while force aligning hindi audios
#856
xorsuyash
opened
3 months ago
0
provide a option to use local VAD model
#855
NewUserHa
opened
3 months ago
0
OSError: undefined symbol: _ZN2at4_ops10zeros...
#854
jessienab
opened
3 months ago
7
how to use whsperx with hugging face pipeline
#853
Oheed911
opened
3 months ago
0
Update alignment.py - added alignment for sk and sl languages
#852
jan-panoch
closed
3 months ago
2
whisperX witout internet access
#851
KarelVesely84
opened
3 months ago
2
Unable to run the whisperx with the installation steps provided in repository
#850
GlitCher50
closed
3 months ago
3
WhisperX return translated output instead of normal transcription
#849
BankNatchapol
opened
3 months ago
0
Gradio Demo
#848
lifeiteng
closed
3 months ago
0
ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation.
#847
S90pngprgm
opened
3 months ago
4
how to change max_new_tokens (parameter) of whisper
#846
freshpearYoon
opened
4 months ago
0
Speaker Diarization for bilingual speech
#845
01Ashish
opened
4 months ago
0
OAI Whisper transcribes correctly but whisperx returns `No active speech found in audio`
#844
reasv
opened
4 months ago
9
Bulk Processing
#843
Narbs1
opened
4 months ago
1
How to run Whisper X in Colab?
#842
ioudove
opened
4 months ago
4
Use whisperx and pyannote in Colab without HuggingFace token
#841
biagioscalingipsy
opened
4 months ago
1
How to use a fine-tuned segmentation model for diarization?
#840
Arche151
opened
4 months ago
6
How to achieve known text content and obtain the timestamp of the text corresponding to the audio
#839
RichardQin1
opened
4 months ago
2
Could I Add Timestamps to My Text by WhisperX?
#838
vick-wuwei
closed
4 months ago
2
Allow Repetition
#837
BrothaM
closed
4 months ago
1
AttributeError: partially initialized module 'whisperx' has no attribute 'load_model' (most likely due to a circular import)
#836
bsinghrana
closed
4 months ago
2
fix vad model load bug.
#835
duj12
opened
4 months ago
0
some not-default language alignments are downloaded, some throw error
#834
Marcophono2
opened
4 months ago
0
how can i close Vad?
#833
grx666
opened
4 months ago
0
Previous
Next