Armandpl / dreamerv3

DreamerV3 + gSDE, using pytorch, on a real robot
1 stars 0 forks source link

5 implement reinforce #7

Closed Armandpl closed 7 months ago

Armandpl commented 7 months ago

ok seems like we can solve cartpole, let's see if we can solve minatar or reveal some new bugs https://wandb.ai/armandpl/minidream_dev/runs/l6970cbc/workspace?workspace=user-armandpl