Open mchivuku opened 1 year ago
Hello! We've published a big update that provides more reasonable implementations of these intrinsic rewrads.
Now you may get the same performance of the ICM.
If you have any other questions, please don't hesitate to ask here.
@mchivuku
Thank you. I will work on taking the update and applying it on my problem.
Hello,
I used the example code provided: https://github.com/yuanmingqi/rl-exploration-baselines/blob/main/examples/ppo_re3_bullet.py to compute intrinsic rewards using ICM module. I found the results were different from what I have got from running mlagents icm module. This is against the custom unity game wrapped in gym wrapper. Can you please let me know what could be different that I must be missing? Thank you so much for the help. I can also share the code.