Closed sglucas closed 8 months ago
The code link is correct. If you look at the summary of the tracked experiment, the env_id
is BigFishEasy-v0
, which is one of the procgen’s environments. As pointed out in #340, EnvPool >=0.8.1 introduces procgen environments, basically allowing us to handle Atari and procgen using the same codebase. There are minor API differences, though, if you try to diff the code and what was used to handle Atari at #338 (mostly minor API differences). Also note that there are differences in hyperparameters.
A good way is to probably rename #338's implementation as ppo_impalacnn_jax_scan.py
which handles both Atari and procgen environment. Could you give it try and make a PR based on #338? Happy to provide more context and info.
Hi @vwxyzjn do you try to use this link https://github.com/bmazoure/ppo_jax. I find the current code does not test the learned policy in all 1000 environments.
Thanks a lot!
https://github.com/vwxyzjn/cleanba should work with procgen if you want to take a look :)
Thank you very much for your contribution.
May I ask if is it possible to release the PPO+Procgen code based on Jax?
Best