-
较长热词(比如四个字及以上)影响ncnn流式语音识别模型正常工作,比如python-api-examples/speech-recognition-from-microphone-with-endpoint-detection.py中添加hotwords_file=r"\sherpa-ncnn-streaming-zipformer-bilingual-zh-en-2023-02-13\hotwo…
-
Investigate whether sample_rate is causing problems or needs to be hardcoded according to device
Socket timeout
-
Hi,
I’m currently using RealtimeSTT with the following configuration:
```
recorder_config = {
'spinner': False,
'model': 'large-v2',
'language': 'en',
'silero_sensitivity': …
zbeb updated
3 months ago
-
A dedicated mode for fluent conversation. See Google, Microsoft or Apple apps.
- interface to display running conversation
- side by side, or mirrored halves of screen?
- a switch for automati…
-
### Severity
Major
### Versions
18.17.1
### Components/Modules
talkdetect / dsp
### Operating Environment
all
### Frequency of Occurrence
Occasional
### Issue Description
talk_detect signal…
-
Instead of pushing all predictions back into datasets, only do this for *confident* ones, e.g.:
* minimum threshold for score/probability
* entropy based (Eibe?)
Use as parameter in job templates…
-
Hi,
I would like to request adding an interruption bool in updateSession. The expected behavior would be to toggle (on/off) the interruption mechanism of the assistant when the user speaks over them.…
-
非常感谢如此神作!但是对于海外党来说国内的这些服务延迟太高了,尤其是 OCR 服务基本每次都要等个 10 秒左右,希望能加入微软的 Azure 全家桶,AWS 全家桶和补齐 Google 全家桶
## 微软 Azure 全家桶
**文本翻译**
文档: https://docs.microsoft.com/en-us/azure/cognitive-services/translat…
zenof updated
2 years ago
-
Or without using their API or console?
-
Hi, @felixkreuk , first thank you for open-sourced such good repo on unsupervised phoneme segmentation. Recently, I conduct several experiments on SpeechOcean 762 dataset, which is a standard speech s…