feldberlin / wavenet

An unconditioned Wavenet implementation with fast generation.
3 stars 0 forks source link

Model averaging #24

Open purzelrakete opened 3 years ago

purzelrakete commented 3 years ago

What

Try model averaging. This is reported to significantly increase the test performance of generative models:

  1. Ployak averaging, applied only at test time. Train as per usual.
  2. Stochastic weight averaging. See here for pytorch implementation.

Hypothesis

The performance of the model at test time will be much better with an averaged model. People who KNOW say as much.

Results

Write up the results of your experiment once it has completed and has been analysed. Include links to the treatment run, and also to the baseline if appropriate.

Acceptance Criteria