Closed meric-sakarya closed 4 years ago
Hello,
Why are you not using the zoo?
Please read the documentation to understand the meaning of each variable (n_envs
correspond to the number of environments/workers to collect data). You can also read the code from the zoo train.py
and utils/utils.py
to see how n_envs
is used.
I don't know how to install zoo on Windows and couldn't find documentation on it.
I don't know how to install zoo on Windows and couldn't find documentation on it.
You just need to clone the repo and install python dependencies, it is not specific to windows. You could also use it online using google colab (cf README).
Can't I just use the hyperparameters provided by zoo without actually using zoo? I am not interested in already trained agents and also I am going to need some wrappers in another environment.
Can't I just use the hyperparameters provided by zoo without actually using zoo? I am not interested in already trained agents and also I am going to need some wrappers in another environment.
In SB3 version (https://github.com/DLR-RM/rl-baselines3-zoo), you can download the training scripts without the trained agents.
Here, you can download the repo and then delete the trained_agents
folder.
Otherwise, you can just copy the files.
I managed to get an agent working thanks for the help. But isn't this graph pretty weird? What might be causing this? Can I wrap this with the monitor wrapper to get a better looking graph?
There are already issues about that: https://github.com/hill-a/stable-baselines/issues/143
In short: you should ignore it and use Monitor wrapper or proper evaluation (using EvalCallback
included in the zoo).
Closing this issue as the original questions was answered.
@araffin I added the following line to the ppo2.yml file: env_wrapper: stable_baselines.bench.monitor. It did not work and I received his error when trying to restart the training: Should I create a new issue for this?
I am trying to change the hyperparameters of PPO2 to achieve better results in Pendulum-v0. I have tried using the parameters from RL Baselines Zoo here but I receive this error.
Here is the code:
System Info Describe the characteristic of your environment:
Describe how stable baselines was installed (pip, docker, source, ...): source GPU models and configuration: NVIDIA GTX 1050 with CUDA 10.0 and cuDNN 7.6.5 Python version: 3.7 Tensorflow version: 1.15