Open flrngel opened 6 years ago
https://arxiv.org/abs/1802.06006 Paper from Baidu Research
Paper will do
Paper Notations
Paper avoids mode collapse with training speaker encoder seperately
Because human is so expensive, paper propose those two solutions for evaluation
https://arxiv.org/abs/1802.06006 Paper from Baidu Research
Abstract
Paper will do
1. Introduction
2. Voice Cloning
Paper Notations
2.1. Speaker adaption
Speaker adaption function
2.2. Speaker encoding
Speaker encoding function
Paper avoids mode collapse with training speaker encoder seperately
Loss function (L1)
Architecture
2.3. Discriminative models for evaluation
Because human is so expensive, paper propose those two solutions for evaluation
2.3.1. Speaker Classification
2.3.2. Speaker Verification
Experiments
3.1. Datasets