DLR-RM / rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
https://rl-baselines3-zoo.readthedocs.io
MIT License
2.01k stars 510 forks source link

Fix minigrid policy type and add new envs #370

Closed qgallouedec closed 1 year ago

qgallouedec commented 1 year ago

Description

Motivation and Context

Types of changes

Checklist:

Note: we are using a maximum length of 127 characters per line

qgallouedec commented 1 year ago

@araffin why does minigrid need gym-packages arg? If there is no specific reason, I would use import_envs.py, like for other libraries.

araffin commented 1 year ago

@araffin why does minigrid need gym-packages arg? If there is no specific reason, I would use import_envs.py, like for other libraries.

mainly for legacy reasons and to demonstrate how to use --gym-packages.

qgallouedec commented 1 year ago

Would you agree to remove it?

araffin commented 1 year ago

Would you agree to remove it?

Remove the argument completely, no. It is actually useful from time to time (even though the feature is now mostly covered by python config now), and we should probably rename it to packages.

Change the way minigrid is used? yes, but then we should probably add a test for the gym-packages to avoid regressions.

qgallouedec commented 1 year ago

I mean, changing the way that minigrid is used, sorry I agree with your remark about testing ˋgym-packages`

qgallouedec commented 1 year ago

https://wandb.ai/qgallouedec/sb3-minigrid

qgallouedec commented 1 year ago

I just wait to have one run per env (and maybe adjust n_timesteps) before merging

qgallouedec commented 1 year ago

I think that we're good now

qgallouedec commented 1 year ago

Note that I did not run the benchmark script. It fails during the evaluation Pendulum/PPO, but I did not investigate further.

araffin commented 1 year ago

fyi, I removed the need for installing test env package and custom mypy arg in https://github.com/DLR-RM/rl-baselines3-zoo/pull/357/commits/666abb320ef311c99726b8448cf03b08c7c2dbc1

qgallouedec commented 1 year ago

It's much better like this