-
Is it possible to make a CPU version of controllable talknet on windows? It should be as someone has already done this on colab
Thank you!
-
Observing that there is no mature data preprocessing in this project, which directly uses raw data for training, we propose to add data preprocessing
-
Your work is really impressive. In your code, I could not find the SpeechAutoEncoder class. Could you please provide the code for this part and the weights? Many thanks.
HLSUD updated
6 months ago
-
First of all, thank you for the project, I think it is really useful, especially that the official NVIDIA implementation is not released yet!
Did you manage to train the model to satisfactory quali…
-
请问一下哪里能看到相关实现呢?
-
Dear Paveel,
Thanks a lot for releasing the supplementary material at https://arxiv.org/pdf/2203.13086v4.pdf and releasing the architecture code at this repo.
However, I don't find in the Githu…
-
Hi, thank you for sharing your excellent work.
I want to ask about your end-to-end TTS model. In the paper, you stated that only the decoder is changed such that it can generate waveform (by using Wa…
-
See if there is a way to run text to speech and read the generated text out loud.
Ideally, this happens in parallel to the LLM generating tokens (using the [second CPU core](https://coral.ai/docs/d…
-
### OpenVINO Version
2024.1.0
### Operating System
Other (Please specify in description)
### Device used for inference
CPU
### Framework
None
### Model used
Custom (a version of Hifi-GAN)
##…
-
## 論文タイトル(原文まま)
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems
## 一言でいうと
低遅延な音声対話システムのために、テキストと音声を並行して生成する新しい手法を提案し、その有効性を示した。
### 論文リンク
https:…