eloialonso / iris

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
https://openreview.net/forum?id=vhFu1Acb0xb
GNU General Public License v3.0
791 stars 77 forks source link

Regarding the game-specific detailed configurations #25

Closed rzbsys closed 4 months ago

rzbsys commented 5 months ago

Hi there,

Really enjoyed reading your paper as well as the codes. Thanks!

I was wondering if I can get the detailed configurations for each game in the Atari environement. For example, we noticed that in the atari environment, we can manually set the "frame-skip", and "noop-max" values and etc. In your default configuration, they are set to 4 and 30 respectively, but I was wondering if this value is consistent throughout all the environments in your experimental settings.

Also, did you enable "clip_reward" for all your games (experiments)?

I looked extensively in your paper's supplementary part (appendices) as well as the code, but some of the details to replicate the experimental results are missing it seems.

If there is any other literature (paper) that has disclosed such configurations, I would appreciate if you could share such resource as well.

Thank you!

vmicheli commented 4 months ago

Hey,

Thanks for the kind words!

We use the default configuration for the whole benchmark.

We classify rewards as -1/0/+1 but we do not use an environment wrapper for that. See https://github.com/eloialonso/iris/blob/ac6be401fed2b6176c9ce0cf1dc10e376c9d740d/src/models/world_model.py#L120

Hope that helps.