lauramsmith / fine-tuning-locomotion

Apache License 2.0
114 stars 23 forks source link

samples_per_iter=512? #3

Closed swami1995 closed 2 years ago

swami1995 commented 2 years ago

Hey,

Thanks for making the repo public! I was trying to run some of the experiments from the paper and noticed that the training seems pretty slow. Upon some digging I saw that the default hyperparameters for "samples_per_iter" has been set to 512 (which means that the algo is collecting 512 samples before performing 1 SAC/REDQ update). From my limited experience with SAC, I've mostly seen that hyperparameter set to 1 so just wanted to confirm if there's a reason why it's been set to 512 in this case?

Thanks!

swami1995 commented 2 years ago

Sorry, I just realized that each iter also involves 512 critic and policy updates. my bad.