vincekurtz / rddp

Reward-Driven Diffusion Policy
1 stars 0 forks source link

PPO baseline for the bug-trap #18

Closed vincekurtz closed 2 months ago

vincekurtz commented 2 months ago

Adds a simple PPO baseline for the bug trap env.