-
Thank you for sharing such nice repo!
CUDA and CPU now generate different audio (tone) when given same text.
How to make CUDA and CPU generate the same speech (tone) when given same text?
-
**Describe the bug**
跨语种复制模式下从日语到中文会出现粤语输出
For Title , Cantonese output appears from Japanese to Chinese in cross-language copying mode
**Reapped**
1. Get some pure human voice sets of Japa…
-
Changes to be made
1. Image generation required
2. Meta title and description to be added
3. Audio (Content Text to audio)
4. Content Copy option
…
-
### 确认清单
- [X] 我已经阅读过 README.md 和 dependencies.md 文件
- [X] 我已经确认之前没有 issue 或 discussion 涉及此 BUG
- [X] 我已经确认问题发生在最新代码或稳定版本中
- [X] 我已经确认问题与 API 无关
- [X] 我已经确认问题与 WebUI 无关
- [X] 我已经确认问题与 Finetune 无关
##…
-
Add the ability for users to provide audio input for their performance reviews and self-reviews. Previously, users had to type their input, but now they can record their audio. You can use streamlit a…
-
## Description
When using the WebSocket transcription endpoint `/v1/audio/transcriptions`, the server responds with duplicate transcriptions for a single audio input. This occurs consistently for e…
-
### Description
Any tutorial that imports test audio files (e.g. `Audio.from_filepath("../src/tests/data_for_testing/audio_48khz_mono_16bits.wav")`) do not work on Google Colab, as there is no audio …
-
## Description
We are experiencing intermittent failures in audio recognition when using the Google Speech-to-Text v1 library for real-time transcription in our .NET project. The issue manifests as f…
-
```
INFO: Started server process [51254]
INFO: Waiting for application startup.
INFO: Application startup complete.
INFO: Uvicorn running on http://0.0.0.0:10101 (Press CTRL+C to q…
-
Sending text from phone to laptop works, but I get the following error when trying to copy the text to clipboard:
![image](https://github.com/user-attachments/assets/8d894058-cd99-4591-b742-dd20308…