Closed shukla-yash closed 1 year ago
The results presented in the paper were obtained with baselines. To reproduce them strictly, you can use https://github.com/qgallouedec/drl-grasping. However, the code is old and is not maintained anymore. So I strongly advise you to use rl-baselines3-zoo instead. The results are part of openrlbenchmark and are very easy to reproduce. See https://wandb.ai/openrlbenchmark/sb3.
Hi,
I am unable to learn a policy for the PandaPickAndPlace task using RL Zoo. I am trying to get the results shared in the experimental results section of the Panda-gym paper. Here are my hyperparameters for the SAC, DDPG and the TQC algo:
Can you please help me with the hyperparams that you used for your experiments?