-
**Bug Description**
Running RVC single infer (convert_audio) on an Intel based Macbook pro with `PYTORCH_ENABLE_MPS_FALLBACK=1` does not seem to be working. Get a segmentation fault after a minute or…
-
For people working with potentially sensitive audio/data, how can we handle segmentation and diarization locally vs relying on hf API calls? is this an option and I am overlooking? Great tool by the…
-
Hi...
As recommended on GitHub, the best size of chunks is 10 to 30 seconds. However, the Librispeech dataset was split into various sizes starts from 2 secs.
My question is what is the optimal chun…
-
- Speech SDK log taken from a run that exhibits the reported issue.
Check here [https://gist.github.com/Elshaffei/cb1f13f1d79ccd6df0641b864420bc93](url)
- A stripped down, simplified version of y…
-
I am running tests with about 20 different audio files with different languages. I try the same audio file with both "diarize_whisper.rs" and "pyannote.rs". First of all I can say that segmentation an…
-
I have been able to reproduce this error regardless of the input file or the cli options used.
Error below
```
(whisper-diarization-main) PS F:\whisper-diarization-main> python diarize_parall…
-
I am running a Stream Deck + with the Audio Control plugin. I am having an issue when I add the **Dial controller** to one of th e dials, it seems to randomly crash.
**Steps to reproduce the behavi…
-
### Tested versions
3.1
### System information
Ubuntu 16
### Issue description
Sometimes there will be over-segmentation, two people's audio is divided into five people, the longer the audio, the…
-
I have a WhisperX Python script for transcribing meetings, but the speaker diarization for German is really bad, unfortunately.
After some research I came across the fine-tuned German segmentation…
-
when generatin spectrograms, if several files are shorter than chosen duration, the code stops, it should be possible. We should pad the short files with zeros so the duration is the same than desired…