google / dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
https://github.com/google/dopamine
Apache License 2.0
10.42k stars 1.36k forks source link

Getting 7 as action for a game with 3 actions #170

Open arjung128 opened 3 years ago

arjung128 commented 3 years ago

I have been trying to train an online agent on the environment FreewayNoFrameskip-v4. Because this gym environment is not deterministic, I seeded the environment. Specifically, in atari_lib.py, I added

No other changes were made. I then used the Batch RL codebase to train an online agent.

In all of training, there was one instance of a 7 stored as the action (specifically the last action at the end of five iterations of training), even though Freeway only has three actions. All other stored actions were {0, 1, 2}. Any ideas what could be the cause of this? Going in and changing this one 7 to the most common action isn't a problem, but if this problem arises repeatedly, and for other games, it could be difficult to deal with.

psc-g commented 3 years ago

hi, is this still an issue? are you sure you're reloading the correct checkpoint (for the same game)? otherwise, it seems like it might make more sense to ask in the batch rl repo?