-
I set up a local HLS stream playing a long video of someone talking.
Everything seems great until after exactly 2 minutes in the transcription stops completely.
INFO:faster_whisper:Processing…
-
I downloaded the 1min of silence mp3 but.. I can hear something in it.
I can also see some variation when opening the file in audacity - is this because of mp3 compression?
-
Failed to execute the make command on the raspberryPi 4 Model B
```
make -C ../libs/miniwebrtc
gcc -Wall -I../.. -O2 -O3 -funroll-loops -fomit-frame-pointer -fsigned-char -fno-exceptions -fPIC -…
-
I have fine-tuned an hugging face model. This model is not outputting end time stamp token for the segment.
When I run faster whisper with this model and use "word_timestamps=False", it is giving the…
-
Currently if real time transcription continues for a long time then sentence appear to be a huge so it's processing takes a lot of time even with using Cuda. Is that possible to set something like max…
-
Hello,
I am currently working on synchronizing lyrics in SRT files. However, I'm encountering an issue where the sentences are too long, and I would like to split them into individual lines for syn…
-
Build fails on rpi4 with Raspbian 64bit bookworm and python3.12.1, with ```/usr/lib/gcc/arm-linux-gnueabihf/12/include/arm_neon.h:6940:1: error: inlining failed in call to ‘always_inline’ ‘float32x4_t…
-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
# Pre-Processing
- [x] VAD
- [ ] NoiseRemoval
- [ ] AudioRouter to different modules depending on the audio quality (like [RouteLLM](https://github.com/lm-sys/RouteLLM))
- [ ] #10
# Main-Proces…
-
This module seems to generate a lot of aliasing. Is it intended to be use only with low frequency CV signals?
![image](https://user-images.githubusercontent.com/16314838/95147882-dc43ee80-0736-11eb-8…