expressive-speech-synthesis Search Results

DagsHub/audio-datasets #34

EmoV-DB

The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems This dataset is built for the purpose of emotional speech synthesis. The transcript were …

spirulinax updated 2 years ago

NVIDIA/mellotron #43

Question regarding paper

Hello, First, I apologize if this is not a proper channel to ask about your paper "MELLOTRON: MULTISPEAKER EXPRESSIVE VOICE SYNTHESIS BY CONDITIONING ON RHYTHM, PITCH AND GLOBAL STYLE TOKENS Rafael …

hash2430 updated 4 years ago

huggingface/transformers #16349

[WIP] New Model Add FastPitch 1.1

# 🌟 New model addition ## Model description **What type of model is Fast Pitch 1.1?** It is a Mel spectrogram generator (part of a speech to text model engine) that mainly comprises of two F…

ArEnSc updated 2 years ago

keithito/tacotron #281

Adding linguistic features for expressive synthesis ?

Hi ! Thanks for this great implementation ! I'm a speech scientist, and I'm not an expert in neural networks. I'm working on expressive speech synthesis, and I'm currently wondering about the po…

samuel-lunii updated 4 years ago

Rayhane-mamah/Tacotron-2 #265

GMM attention as alternative to Location Sensitive

Hi, following DeepMind's paper "Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron", they claim to use GMM attention as it generalize better for longer utterances. I …

alexdemartos updated 5 years ago

NVIDIA/waveglow #202

Model can approximately fit but has consistent artifact on e…

Hello everyone, I am experimenting with training WaveGlow on a dataset of singing voice. The loss is optimizing without noticeable issue, although it seems to hardly get lower than around -4. (for …

adrienchaton updated 4 years ago

GasimV/Commercial_Projects #2

`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…

GasimV updated 2 weeks ago

kaiidams/NeMoOnnxSharp #26

Possible to improve English and German pronunciation?

# NVIDIA NeMo (ByT5 G2P and G2P-Conformer): > NVIDIA NeMo provides grapheme-to-phoneme models for various languages, including **German**. > The ByT5 G2P model is based on a neural network and can…

GeorgeS2019 updated 3 months ago

csound/csound #658

Singing synthesis opcodes

Implement an opcode or opcodes for vocal singing synthesis in Csound, inspired by Vocaloid, Sinsy, and such. The opcode should take marked up text and some form of vocal model, and synthesize a music…

gogins updated 6 months ago

syang1993/gst-tacotron #15

Train as a Tacotron1 script problem

Thanks for your great work, but I found that if I set the hyperparameter `use_gst=False` and run, it seemed different from my understanding of Tacotron1. The tacotron.py code is part of here. ```pyth…

dazenhom updated 5 years ago

75 results
for expressive-speech-synthesis