Audio synthesis working on GPU and fixed zeropad in collator - Githubissues

mustass / diffusion_models_for_speech

Deep Learning course project repository.

https://kurser.dtu.dk/course/02456

1 stars 0 forks source link

Audio synthesis working on GPU and fixed zeropad in collator #28

Closed mustass closed 1 year ago

mustass commented 1 year ago

In this PR:

Generating on GPU works
We have an Inference dataloader and a simple collator for inference. This supports passing on the name so we can easily find the originals for benchmarking.
fixed zero_padding, potentially. But it is still unused during generation as code in LitDiffWaveModel.forward() is not batchable right now.

The generation of 650-ish audiofiles will take 14 hours on GPU.