vincekurtz / rddp

Reward-Driven Diffusion Policy
1 stars 0 forks source link

Improve the training procedure #10

Open vincekurtz opened 3 months ago

vincekurtz commented 3 months ago

The current training loop is fairly efficient, but lacks some features. Eventually we will want: