Closed caio-freitas closed 8 months ago
Not starting the DDPM from pure noise, but from the last observed action instead, could facilitate a lot learning (although maybe leading to idling actions).
Not starting the DDPM from pure noise, but from the last observed action instead, could facilitate a lot learning (although maybe leading to idling actions).