caio-freitas / GraphDiffusionImitate

Diffusion-based graph generative policies for imitation learning in robotics tasks 🧠🤖
MIT License
1 stars 1 forks source link

Start DDPM Policy from last action #102

Closed caio-freitas closed 8 months ago

caio-freitas commented 8 months ago

Not starting the DDPM from pure noise, but from the last observed action instead, could facilitate a lot learning (although maybe leading to idling actions).