Closed CAI23sbP closed 1 week ago
Hello,
There is not exist about how to use gsde in docs. Could you explain?
Have you read the gSDE paper? What did you understand and what is not clear? Did you have a look at https://stable-baselines3.readthedocs.io/en/master/modules/ppo.html#ppo-policies ?
use_sde (bool) – Whether to use State Dependent Exploration or not
log_std_init (float) – Initial value for the log standard deviation
full_std (bool) – Whether to use (n_features x n_actions) parameters for the std instead of only (n_features,) when using gSDE
use_expln (bool) – Use expln() function instead of exp() to ensure a positive standard deviation (cf paper). It allows to keep variance above zero and prevent it from growing too fast. In practice, exp() is usually enough.
@araffin Sorry to late reply!. Yes, i read docs. i mean that "Is it okay to use it like this?" about example code.
Yes, i read docs. i mean that "Is it okay to use it like this?" about example code.
You mean "will it run"? yes "is it tailored for my problem?" hard to say, but at least those parameters were found to be working for other envs, you can have a look (and use) the RL Zoo for that: https://github.com/DLR-RM/rl-baselines3-zoo/blob/27e081eb24419ee843ae1c329b0482db823c9fc1/hyperparams/ppo.yml#L137
Thank you for your reply~!
❓ Question
@araffin There is not exist about how to use gsde in docs. Could you explain? for me about PPO, p.s.: my stable-baselines3 version is v2.0.0
use_gsde = True, full_std= True, log_std_init = -2, sde_sample_freq = 4
Checklist