rail-berkeley / rlkit

Collection of reinforcement learning algorithms
MIT License
2.45k stars 550 forks source link

Understanding the SAC paramters #89

Closed nanbaima closed 4 years ago

nanbaima commented 4 years ago

Hi Vitchyr,

If I may, I would kindly like to ask you an question related to: https://github.com/vitchyr/rlkit/blob/1d469a509b797ca04a39b8734c1816ca7d108fc8/examples/sac.py#L92-L96

About:

I did read the SAC paper, however didn't find any specificities such as, num_eval_steps_per_epoch, or min_num_steps_before_training, so I got a little lost.

I would kindly appreciate if you could give me any help regarding this subject!

nanbaima commented 4 years ago

By trial and error I could observe, by felling not scientific :), few things in my implementation:

I could observe that it has to be like the num train per loop >>> n_exploração (exploration should be much higher than train per loop)

Here are the hyper parameters batch_rl_algorithm.py

I hope I can help anyone, and I could make any thing clear...

vitchyr commented 4 years ago

Yup! And to clarify, the num_trains_per_train_loop is per train loop, but typically the number of train loops per epoch is 1, so this is effectively the same.