vincekurtz / rddp

Reward-Driven Diffusion Policy
1 stars 0 forks source link

Condition policy on output, not state #13

Closed vincekurtz closed 3 months ago

vincekurtz commented 3 months ago

Resolves #12.

Also updates the pendulum example accordingly.