-
-
Any plan to detect disfluencies, hesitation, fillers? That would be awesome
-
Running the latest version from the docker image.
```
Version: 1.1.1
===================================
[-] Model name: m-bain/whisperX
[-] Model configs: {'model_type': 'large-v2', 'segment…
-
So I've been working on several different LRC solutions.
The default whisper gets the lyrics to this song:
![image](https://github.com/zh-plus/Open-Lyrics/assets/789591/44524f1b-d95b-4e79-8edd-c…
-
https://github.com/guillaumekln/faster-whisper/issues/455#issuecomment-1705249830
``[/usr/local/lib/python3.10/dist-packages/whisperx/asr.py](https://zodbzjljtq-496ff2e9c6d22116-0-colab.googleuserc…
-
Pls help.
```
C:\Users\Administrator\Documents\GitHub\whisper-diarization>pip install -r requirements.txt
Collecting git+https://github.com/m-bain/whisperX.git@4cb167a225c0ebaea127fd6049abfaa3af9f8…
-
Hi,
First of all : tremendous work here ! Time performances are definitely amazing for both transcription and diarization, even on CPU with int8. It outperforms whisperx on my side, with quite the …
-
```
Traceback (most recent call last):
File "/root/anaconda3/envs/whisperx/bin/whisperx", line 8, in
sys.exit(cli())
File "/root/anaconda3/envs/whisperx/lib/python3.10/site-packages/whisp…
-
import os
import subprocess
import yaml
import pysrt
import torch
import re
import unicodedata
from pydub import AudioSegment
def sanitize_filename(filename):
# Remove diacritics and …
-
I transcribed a 22 minute mp3 file with - ASR_ENGINE=openai_whisper via the webui, timed it and it took 2:15 min. I then changed the env to - ASR_ENGINE=faster_whisper, recreated the container and it …