-
**Describe the bug**
from cosyvoice.cli.cosyvoice import CosyVoice
from cosyvoice.utils.file_utils import load_wav
import torchaudio
cosyvoice = CosyVoice('pretrained_models/CosyVoice-300M')
…
-
**describe the feature you'd like to see**
Download a specific portion of a given youtube video's audio. This specific part is delimited by start and end.
**describe alternatives you've considered**
…
-
Although getUserMedia is helpful for getting camera input, I'm developing a web app that uses speech recognition. I looked into [Pocketsphinx.js](http://syl22-00.github.io/pocketsphinx.js/), but it's …
-
## Goal
- 8th October demo at our event
- Intended as a server-side demo: learnings to be then applied in https://github.com/janhq/jan/issues/3488 (Sprint 22-23)
## Questions
- What data can we col…
-
When running Gradio cookbook, I am running into this error when trying to execute the very last prompt in the cookbook.
Error message shown in the editor:
`Error Exception: ffmpeg was not found b…
-
Mình gặp phải lỗi mong được hỗ trợ, xin cảm ơn.
Mình đang cài phiên bản mới nhất underthesea v6.8.0 chạy windows 10, Anacoda mới nhất, python 3.11 và cài theo như hướng dẫn:
pip install underthese…
-
同时我单独使用vad推理,利用sounddevice从麦克风读取数据,用以下
```
sample_rate = 16000 # 采样率
channels = 1 # 单声道
dtype = "int16" # 数据类型
blocksize = 1024 # 块大小
def record_audio():
with sd.InputStream(
…
-
### 🐛 Describe the bug
For batch_size > 1, variable-length inputs (e.g. speech, text) are padded in order to construct one batch tensor.
**When this tensor goes through nn.InstanceNorm series (nn.…
-
Disable Tailored Experiences
![Untitled2](https://github.com/user-attachments/assets/450a56b0-b230-4ffa-9f7e-bd81b7ab9679)
[HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Windows\CurrentVersion\Privacy]
"…
-
Steps to reproduce
------------------
1. I was trying to use an audio output as ``Microphone()``
When I list my microphones with
```python
import speech_recognition as sr
for index, name in…