m-bain whisperX issues - Githubissues

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

BSD 2-Clause "Simplified" License

12.61k stars 1.33k forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Updated asr.py to allow replacing initial_prompt on transcribe method.

#882 clararoft opened 2 months ago
1
Question on the pseudo code of arxiv paper

#881 treya-lin opened 2 months ago
0
Using large-v3 returns some segments in all uppercase

#880 caryknoop opened 2 months ago
1
I’ve successfully installed WhisperX, is there anything I can uninstall to save some disk space?

#879 MituButChi opened 2 months ago
2
ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation.

#878 kc01-8 opened 2 months ago
1
bump `pyannote.audio` version to 3.3.1

#877 JSchmie opened 2 months ago
0
Feature/segement printing

#876 Hasan-Naseer opened 2 months ago
6
latest faster-whisper support added

#875 Hasan-Naseer opened 2 months ago
3
Split words separated by hyphens

#874 atferrys opened 2 months ago
0
Use whisperx diarization offline

#873 itaipee opened 2 months ago
2
Can Hard Coded Hyperparameters be moved to a config file?

#872 morsczx closed 2 months ago
4
Parameter to enable verbose/Segment level printing for better debugging

#871 Hasan-Naseer opened 2 months ago
1
Open PR to add latest version of faster-whisper

#870 Hasan-Naseer opened 2 months ago
1
Wav2vec doesn't align numerical characters

#869 pr-data-port opened 2 months ago
1
KeyError 'en'

#868 pr-data-port opened 3 months ago
1
Added local_files_only option on whisperx.load_model for offline mode

#867 RoqueGio opened 3 months ago
0
Transcribing error

#866 Siddiq199 opened 3 months ago
2
Could not load library libcudnn_ops_infer.so.8. Error: libcudnn_ops_infer.so.8: cannot open shared object file: No such file or directory

#865 IgorEzerskiy closed 3 months ago
4
Fix/faster whisper issue

#864 matiasfnunezdev closed 3 months ago
0
Implement PR improvements

#863 matiasfnunezdev closed 3 months ago
0
transcribe: pass `suppress_numerals` into function to control per request

#862 eschmidbauer opened 3 months ago
0
Is there a way to transcribe multiple audio files asynchronously/parallel with whisperX?

#861 imc-db opened 3 months ago
2
WhisperX just stops at Diarization

#860 Eli-117 closed 3 months ago
0
Version 3.1.5 is distributed on pypi but Github repo only has 3.1.1?

#859 dannguyen opened 3 months ago
3
How to enable diarization in python code (not terminal)?

#858 imc-db closed 3 months ago
2
Turning off timestamps?

#857 IndolentKheper opened 3 months ago
1
not being able to pickup words at the last of the audio's while force aligning hindi audios

#856 xorsuyash opened 3 months ago
0
provide a option to use local VAD model

#855 NewUserHa opened 3 months ago
0
OSError: undefined symbol: _ZN2at4_ops10zeros...

#854 jessienab opened 3 months ago
7
how to use whsperx with hugging face pipeline

#853 Oheed911 opened 3 months ago
0
Update alignment.py - added alignment for sk and sl languages

#852 jan-panoch closed 3 months ago
2
whisperX witout internet access

#851 KarelVesely84 opened 3 months ago
2
Unable to run the whisperx with the installation steps provided in repository

#850 GlitCher50 closed 3 months ago
3
WhisperX return translated output instead of normal transcription

#849 BankNatchapol opened 3 months ago
0
Gradio Demo

#848 lifeiteng closed 3 months ago
0
ValueError: Requested float16 compute type, but the target device or backend do not support efficient float16 computation.

#847 S90pngprgm opened 3 months ago
4
how to change max_new_tokens (parameter) of whisper

#846 freshpearYoon opened 4 months ago
0
Speaker Diarization for bilingual speech

#845 01Ashish opened 4 months ago
0
OAI Whisper transcribes correctly but whisperx returns `No active speech found in audio`

#844 reasv opened 4 months ago
9
Bulk Processing

#843 Narbs1 opened 4 months ago
1
How to run Whisper X in Colab?

#842 ioudove opened 4 months ago
4
Use whisperx and pyannote in Colab without HuggingFace token

#841 biagioscalingipsy opened 4 months ago
1
How to use a fine-tuned segmentation model for diarization?

#840 Arche151 opened 4 months ago
6
How to achieve known text content and obtain the timestamp of the text corresponding to the audio

#839 RichardQin1 opened 4 months ago
2
Could I Add Timestamps to My Text by WhisperX?

#838 vick-wuwei closed 4 months ago
2
Allow Repetition

#837 BrothaM closed 4 months ago
1
AttributeError: partially initialized module 'whisperx' has no attribute 'load_model' (most likely due to a circular import)

#836 bsinghrana closed 4 months ago
2
fix vad model load bug.

#835 duj12 opened 4 months ago
0
some not-default language alignments are downloaded, some throw error

#834 Marcophono2 opened 4 months ago
0
how can i close Vad?

#833 grx666 opened 4 months ago
0

Previous Next