Open Armandpl opened 8 months ago
dreamer is especially intersting here bc the dynamics of the system are probably trivial to learn + it's costly to collect steps on the real robot
in the same vibe we could learn a neural sim, maybe use a gpt model like comma is doing? check out karpathy's videos having a good neural sim could be useful to tune/select good controllers, instead of hoping for rl agent convergence