-
Hey guys, right now Im splitting my audio into channels using ffmpeg and numpy, after that I send to `BatchedInferencePipeline.Transcribe` for transcription.
But I was looking at `transcribe.py` cl…
-
We are currently using the Whisper-Tiny multilingual model and seeking ways to improve its performance. We would appreciate any insights or suggestions on how to enhance the model's accuracy, speed, a…
-
### Description
What Does RMSEnergyExtractor Do?
Calculates RMS Energy:
RMS energy is a measure of the power of an audio signal. It is computed as the square root of the average of the squared …
-
## Web Speech API
[Web Speech API](https://techblog.asahi-net.co.jp/entry/2018/06/22/173617#Web-Speech-API)
Web Speech APIでTextまでやってる
## Voice Activity Detection
[Voice Activity Detectio…
-
### Description:
Automated approaches to abuse detection rely on annotated datasets. At least at present, unsupervised machine learning alone cannot detect abuse across languages. To fill the gap of …
-
Hi,
I am currently trying to implement the speech-recorder Voice Activity Detection in my electron App on my M1 Mac and I am facing the current issue :
`Error: dlopen(/myElectronPath/node_modules…
-
This will be done in the following steps
new setup looks the following:
- domains (like rhasspy 3 https://github.com/rhasspy/rhasspy3/blob/master/docs/wyoming.md)
- mic input
- wake …
-
I want to end speech recognition and call FinalResult() when silence last longer than a timeout parameter.
The pocketsphinx-python lib has a [get_in_speech()](https://github.com/bambocher/pocketsphin…
-
Currently we're simply calculating the loudness of incoming audio and if it's below a certain value, we say it's silence. Would be cool to find a specialized algorithm that does this better, like what…
-
Dear colleagues, after install Nemo r1.5.0 I have error in method oracle_model.diarize() in
[Speaker_Diarization_Inference] https://github.com/NVIDIA/NeMo/blob/main/tutorials/speaker_tasks/Speaker_D…