-
Hi @streamer45, thanks for your awesome package! I found discrepancies between silero-vad-go and the Python package. My input file is a 13-minute-long speech of JFK, and silero-vad-go misses multiple …
wjkoh updated
1 month ago
-
Please help me. I want to use your feature in my app.
Android and the web are working perfectly. But I built it on iOS and I had an error below.
[IOSInAppWebViewController] (iOS) WebView ID 112896…
-
Intel MKL allows users with Intel CPU to run transcription and other AI models faster
/bounty 100
context: https://github.com/mediar-ai/screenpipe/issues?q=mkl
MKL was hard to setup with wi…
-
## ❓ Questions and Help
Hi silero team!
When i try to use silero-vad using python, I felt it is good.
But if i use silero-vad using c++, i got quite different result between python and c++.
I …
-
![image](https://github.com/user-attachments/assets/a6fb4262-1424-4b82-9a4a-6d79f00243e2)
Aborted(CompileError: WebAssembly.instantiate(): expected magic word 00 61 73 6d, found 3c 21 64 6f @+0)
w…
-
Transcription appears to be accurate, however the ending timestamps for each line are always set at the beginning timestamp of the next line, resulting in subtitles constantly displayed long after spe…
-
### Feature request
This feature request aims to improve the speed of Whisper's batched version by adding a VAD model (such as pyannote or from NeMO or Silero) and merging chunks up to 30 sec, instea…
-
I noticed while transcribing some of my own audio that near-silence doesn't get removed during VAD. In fact, running `noisereduce` actually made the problem dramatically worse, making 10 seconds of fa…
-
While trying to implement a voice activity detection feature using your vad-web library, I encountered a troublesome issue on iPhone devices. The application works as expected on other platforms, but …
-
## Goal
- Remove the need to press the button, detect the voice
- Medium-term
- Enables ambient voice detection
- Enables interruptibility
- Small model that has binary classifier for vo…