SebastianLarssonDTU / 02456-Reinforcement-Learning-Project

0 stars 0 forks source link

Run experiments #15

Open DoaFever opened 3 years ago

DoaFever commented 3 years ago
DoaFever commented 3 years ago

I have started: Long run for ppo and impala, (needs to be continued again when done) Impala 20 death penalty 50 levels

DoaFever commented 3 years ago

Priority:

  1. Run with 200 levels: IMPALA, IMPALA DEATH
  2. Run Framestacking 50 levels: IMPALA, IMPALA DEATH. (framestacking 2, 32 env og 4, 16 env)
  3. Run Impala death pen 50 levels for long (16M timesteps)!