NVIDIA mellotron issues

NVIDIA / mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

BSD 3-Clause "New" or "Revised" License

853 stars 187 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Replace split word_tokenize to deal with punctuations.

#75 sungjae-cho closed 3 years ago
14
How to choose a target speaker for generating voice

#74 deepuvikraman closed 3 years ago
5
modules.py: add input lengths processing to ReferenceEncoder

#73 hubeibei007 closed 4 years ago
1
why ignore speaker embedding?

#72 zzw922cn opened 4 years ago
2
ReferenceEncoder did not use the actual mel lengths

#71 hubeibei007 closed 4 years ago
2
Question about the gender of the Libri dataset used by Mellotron mention in the published article

#70 TitiAffandi opened 4 years ago
1
require large memory

#69 aijianiula0601 opened 4 years ago
3
Speaker order is randomised while loading - why?

#68 karkirowle closed 4 years ago
1
Difference between CMUDict of None?

#67 lqniunjunlper opened 4 years ago
1
Inference failed

#66 lqniunjunlper opened 4 years ago
1
Inference bug?

#65 lqniunjunlper closed 4 years ago
1
Style not being applied

#64 kannadaraj opened 4 years ago
4
Gradient overflow with Mixed Precision Training

#63 MinHyung-Kang opened 4 years ago
1
Using own text to generate speech using Mellotron

#62 astricks closed 4 years ago
8
Adding another speaker

#61 JakubReha closed 4 years ago
5
Question about custom dataset

#60 LucasRotsen closed 4 years ago
2
The speaker ids are misaligned in inference.ipynb

#59 pneumoman closed 4 years ago
7
[QUESTION] About lyrics hyphenation

#58 loretoparisi closed 4 years ago
5
Generic Text-to-Speech Inference

#57 GreenGarnets opened 4 years ago
1
Funky warble on sustained sung notes from a MUSICXML file

#56 josharmenta opened 4 years ago
2
Information about p_teacher_forcing hyperparameter

#55 paarthneekhara closed 4 years ago
3
Synthesized voice does not correspond to the speaker id

#54 paarthneekhara closed 4 years ago
4
Singing Voice from Music Score

#53 Sangkikim-77 closed 4 years ago
4
Inference troubles on Windows

#52 camjac251 opened 4 years ago
21
Docker

#51 pneumoman opened 4 years ago
0
CUDA out of memory while running inference file

#50 pathakmukul closed 4 years ago
7
Windows Anaconda

#49 camjac251 opened 4 years ago
5
How to fine tune a new voice using pretrained model

#48 mathigatti opened 4 years ago
3
Warning Message in yin

#47 pneumoman closed 2 years ago
5
synthesized speaker quality changed

#46 kannadaraj opened 4 years ago
5
The effect of passing in the original MEL seems minimal

#45 pneumoman closed 3 years ago
3
Dimensions mismatch when using pretrained model

#44 mathigatti closed 4 years ago
1
Question regarding paper

#43 hash2430 opened 4 years ago
4
Pitch contour not being applied

#42 tebin closed 4 years ago
2
Unable to reproduce decent quality generated audio with training data samples

#41 rohanbadlani closed 4 years ago
1
Installation issues on Ubuntu 18.04

#40 loretoparisi opened 4 years ago
1
Adaptation

#39 karkirowle opened 4 years ago
5
Singing voice from audio signal

#38 tebin closed 4 years ago
3
Training DB for Waveglow pretrained model in this repo

#37 hash2430 closed 4 years ago
3
No of training iterations of pretrained model

#36 alexvioni opened 4 years ago
1
build(deps): bump tensorflow from 1.15 to 1.15.2

#35 dependabot[bot] closed 4 years ago
0
inference_noattention for new sequences

#34 texpomru13 opened 4 years ago
3
Correction for text with punctuation and dash

#33 hyunjoolee opened 4 years ago
0
LJS trained model?

#32 scottbouma closed 4 years ago
2
The distinction between different speaker with mandarin dataset is not obvious.

#31 chynphh opened 4 years ago
2
Cannot resume training without quality loss

#30 AndroYD84 closed 4 years ago
5
how many epochs to get good results for libritts clean 100?

#29 xilaili opened 4 years ago
0
How to fix problematic words with MusicXML parser?

#28 AndroYD84 opened 4 years ago
7
Fixed CMUDict and ARPAbet conversion (p_arpabet is currently not used)

#27 xDuck closed 4 years ago
1
hparams trainings settings

#26 peter1000 closed 4 years ago
2

Previous Next