Closed nanbaima closed 4 years ago
By trial and error I could observe, by felling not scientific :), few things in my implementation:
I could observe that it has to be like the num train per loop >>> n_exploração (exploration should be much higher than train per loop)
Here are the hyper parameters batch_rl_algorithm.py
I hope I can help anyone, and I could make any thing clear...
Yup! And to clarify, the num_trains_per_train_loop
is per train loop, but typically the number of train loops per epoch is 1, so this is effectively the same.
Hi Vitchyr,
If I may, I would kindly like to ask you an question related to: https://github.com/vitchyr/rlkit/blob/1d469a509b797ca04a39b8734c1816ca7d108fc8/examples/sac.py#L92-L96
About:
I did read the SAC paper, however didn't find any specificities such as,
num_eval_steps_per_epoch
, ormin_num_steps_before_training
, so I got a little lost.I would kindly appreciate if you could give me any help regarding this subject!