amathislab / lattice

[NeurIPS 2023] Latent Exploration for Reinforcement Learning
MIT License
27 stars 2 forks source link

Question on reproducing results in the elbow pose task #1

Closed Beanpow closed 6 months ago

Beanpow commented 7 months ago

Hi, thanks for sharing your code. Recently, I am trying to reproduce your result in your paper. But I got different results. Could you give me some insights?

I didn't modify any code, just ran the following commands

and here is the result I got.

image

So, my question is:

  1. Some commands in your Readme don't open the --use_sde flag, so should I open the flag if I want to reproduce the result in your paper?
  2. The default args are different from your paper, which args I should use? such as the init_log_std in main_pose_elbow.py.
albertochiappa commented 7 months ago

Hi, thanks for trying out our code!

In the supplementary material of the paper you can find the detail of the hyperparameters used for every environment, you can take them from there. As you have also tested, Lattice with gSDE required higher initial standard deviation to work well in the elbow pose environment. We used multiple environments in parallel to speed up the training, you might want to try that.

Lattice without SDE was added after the publication of the paper, so we have not benchmarked it in the elbow pose environment. However, we have used for our solution of the MyoChallenge 2023. From your learning curves it seems like it's solving the task correctly!