-
I found quite frustrating while I trying to use the TTS combine with azure 's ASR . For some reason, The TTS output was received by ASR incorrectly even if I mute the microphone with Pyaudio. So pleas…
-
Hello, thanks for the great work, it was interesting! Can you please tell me more about how you align the speech space with the text vocabulary? As far as I understand, you use 200 centroids from the …
-
### Ticket Contents
Develop connectors for AWS and GCP speech & translation connectors.
### Goals
To provide support for the speech and translations of data for cloud providers such as GCP an…
-
Thank you for your hard work!
Is there any zeroshot tts samples? Also, Could it run with cross lingual data?
-
Hi, thank you so much for the amazing repo—it's really very cool!
I'm trying to add a new language, but I encountered an issue with IPA symbols. Specifically, 5 letters are missing. I checked symbo…
lpscr updated
2 weeks ago
-
使用的版本号 0.8.1
转写时实时日志框一直不动
以下是faster-whisper报错日志信息
`None of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and only tokenizers, configuration and file/data utilitie…
-
I am trying to use azure avatar with python.
It works fine in a basic mode, but fails in a chat mode.
After I have inputed some voice I see it on screen and I am getting:
```
Result id for avatar…
-
I am very interested in your project. I am looking for an audio streaming transcripting method for language learning.
Long story short, i set large-v3 as model and choosen my audio_device: Headset …
-
Hello @aluminumbox , I continued training the `llm` model on a German dataset (300 hours), but after 25k steps the model could not pronounce German and the 5 available languages.
My process:
- I f…
-
## Goal
See: #56
## Methodology
- Encoder: WhisperVQ (audio to semantic tokens) with lastest v3 checkpoint supporting 7 more languages.
- Significant Dataset improve:
+ Scaling Speech Instr…