kakaoenterprise / JORLDY

Repository for Open Source Reinforcement Learning Framework JORLDY
Apache License 2.0
362 stars 49 forks source link

Performance degradation of ICM, RND #106

Closed leonard-q closed 2 years ago

leonard-q commented 2 years ago

After the PPO and ICM, RND are modified, the performance of ICM and RND degrades. The performance of ICM and RND should be recovered through the change of the techniques and hyperparameters.