-
Aktualni verze whisperx hlasi behem zpracovani audia velke mnozstvi warning.
Pokud by je slo alespon castecne vyresit, bylo by to super
```
INFO:app.routers.stt:Received URL for processing: h…
-
执行了pip install -r requirements.txt
将所有模型都本地化了
![image](https://github.com/user-attachments/assets/0eb4ce75-7f2f-4589-a718-31dffc7c1270)
执行python testing.py 报错
(whisperx-offline) jzxt@jzxt-SY-KL-H5…
-
- Incorporate ASR confidence estimation scores with ConfidenceConfig in NVIDIA NeMO : https://github.com/NVIDIA/NeMo/blob/main/tutorials/asr/ASR_Confidence_Estimation.ipynb
- Investigate incorporating…
-
I am using whisperx for inference (which is built upon faster-whisper).
I have finetuned large-v3 model on 1k hours of domain-specific data. When I run standard inference the results are ok. Finetu…
-
Is there a repo or code that allows for real-time streaming with whisperx? Thank you!
-
I've been using WhisperX but I keep coming across issues whereby parts of the transcript are just missing entirely (i.e. half of sentences). I have ran the same audio file through OpenAI's Whisper API…
-
使用
```
ct2-transformers-converter --model BELLE-2/Belle-whisper-large-v3-zh --output_dir belle-whisper-large-v3-zh-ct2 \
--copy_files tokenizer.json --quantization float16
```
转换模型后,使用`whispe…
-
I think it would be great to be able to leverage WhisperX and speaker diarization. Any plans to do this?
https://github.com/m-bain/whisperX
-
![image](https://github.com/user-attachments/assets/d9b7e4dd-bc3c-48b1-be6f-8b295864fa30)
![image](https://github.com/user-attachments/assets/45b9ce30-cad8-4c04-bdfd-6b231bd2ac24)
-
Traceback (most recent call last):
File "C:\PROGRA~2\FASTER~1\fasterwhispergui.py", line 67, in
File "", line 1027, in _find_and_load
File "", line 1006, in _find_and_load_unlocked
File "…