thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.
https://tianshou.org
MIT License
7.9k stars 1.12k forks source link

Question on benchmark hyperparameters #1220

Open RolandStolz opened 1 week ago

RolandStolz commented 1 week ago

First of all, thank you for the very detailed benchmark descriptions. That is truly amazing! However, I am having trouble finding the specific hyperparameters used for the benchmarks.

Are the used hyperparameters just the ones provided in the default arguments (e.g. lines 25-75 in examples/mujoco/mujoco_ppo.py)? If so, that would mean that you used the same hyperparameters for all mujoco environments, which I hardly believe is the case. I feel like I'm missing something here.

Any help is greatly appreciated!

Trinkle23897 commented 1 week ago

I believe it's the default hyperparameter. You can checkout the commit mentioned in the benchmark page and see what's actually ran at that time.