speech-synthesis-model Search Results

1000+ results
for speech-synthesis-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

espnet/espnet #5871

Speech-to-Speech/Audio-to-Audio support

Could you add a native speech to speech / audio-to-audio support with encoder (tokenizer) and decoder (back to audio waves) I was able to implement a decoder only model, I first used audio codec to…

nanowell updated 3 weeks ago
4
NVIDIA/DeepLearningExamples #1010

Question on Speech synthesis models

I've been trying to set up a speech model on an Xavier NX, and I've been able to get Tacotron2/Waveglow running, however the the size of the models uses quite a lot of memory. I've been looking to use…

Jcwscience updated 2 years ago
8
Azure-Samples/cognitive-services-speech-sdk #2557

en-US Andrew Multilingual Neural voice has changed and is mu…

**IN ORDER TO ASSIST YOU, PLEASE PROVIDE THE FOLLOWING:** - Speech SDK log taken from a run that exhibits the reported issue. See [instructions on how to take logs](https://docs.microsoft.com/azu…

amitoj-saini updated 4 days ago
7
myshell-ai/OpenVoice #263

Base model for zero shot speech generation

Hello together, I am currently trying to use OpenVoice for German language generation. I have not been able to figure out how this zero shot speech synthesis shall work. Is there some kind of multila…

cjohn001 updated 2 months ago
1
Azure-Samples/cognitive-services-speech-sdk #2481

[iOS] How to fix an issue where my 3D Blendshapes do not ali…

In addVisemeReceivedEventHandler, I receive event.animation. I want to use Viseme 3D Blend Shapes to drive my 3D Avatar. Here is an example JSON: { "FrameIndex": 0, "BlendShapes": [ …

AmAdevs updated 3 weeks ago
1
shivammehta25/Matcha-TTS #94

the motivation for inserting blank IDs between the input IPA…

Hello, could you please help me understand the motivation for inserting blank IDs between the input IPA-ids? The implementation code can be found in text_mel_datamodule.py line216: def get_text(sel…

dbkest updated 1 week ago
1
Azure-Samples/cognitive-services-speech-sdk #2359

Certain voice models emit incorrect word boundary events whe…

**Describe the bug** A subset of the voice models appear to have difficulty processing the three special characters: `` and `&` even when using entity format (https://learn.microsoft.com/en-us/azur…

GJStevenson updated 1 month ago
6
shivammehta25/Matcha-TTS #96

some question about prior_loss

Thanks for your great work. Recently, I am using the hidden_state output from a large language model as the input of the matcha_tts encoder for training. I have fit a sample tens of thousands of time…

UestcJay updated 5 days ago
6
metavoiceio/metavoice-src #120

Support for long-form synthesis.

The gradio app displays that "MetaVoice-1B is a 1.2B parameter base model for TTS (text-to-speech). It has been built with the following priorities: **Support for long-form synthesis. ![i…

computersrmyfriends updated 1 month ago
8
rhasspy/piper #521

Piper running on iOS as a system voice! Anyone with iOS expe…

I’m not sure if anyone noticed, [but there is a swift-native implementation of Piper](https://github.com/IhorShevchuk/piper-ios-app) that allows it to run on iOS -- and to have Piper models be used as…

S-Ali-Zaidi updated 6 days ago
5

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for speech-synthesis-model

1000+ results
for speech-synthesis-model