Start DDPM Policy from last action

caio-freitas / GraphDiffusionImitate

Diffusion-based graph generative policies for imitation learning in robotics tasks 🧠🤖

MIT License

1 stars 1 forks source link

Start DDPM Policy from last action #102

Closed caio-freitas closed 8 months ago

caio-freitas commented 8 months ago

Not starting the DDPM from pure noise, but from the last observed action instead, could facilitate a lot learning (although maybe leading to idling actions).

An appropriate noise schedule has to be chosen here to allow the model to generalize enough