Armandpl / furuta

Building and Training a Rotary Inverted Pendulum robot
18 stars 4 forks source link

try out dreamer #62

Open Armandpl opened 8 months ago

Armandpl commented 8 months ago
Armandpl commented 7 months ago

dreamer is especially intersting here bc the dynamics of the system are probably trivial to learn + it's costly to collect steps on the real robot

Armandpl commented 7 months ago

in the same vibe we could learn a neural sim, maybe use a gpt model like comma is doing? check out karpathy's videos having a good neural sim could be useful to tune/select good controllers, instead of hoping for rl agent convergence