Why using sub09 for class-based generation or unconditional generation?

lmnt-com / diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Apache License 2.0

767 stars 112 forks source link

Why using sub09 for class-based generation or unconditional generation? #37

Open ludanruan opened 2 years ago

ludanruan commented 2 years ago

Hi, I wander why using subset of Speech Command to train but not the whole dataset? In my experiments, diffusion model cannot handle dataset of a large data scale (1M for ~8hours). Did you try the whole dataset and what's the generation performance and the training cost?