kakaoenterprise / JORLDY

Repository for Open Source Reinforcement Learning Framework JORLDY
Apache License 2.0
362 stars 49 forks source link

Improvement performance of IQN #172

Open erinn-lee opened 2 years ago

erinn-lee commented 2 years ago

Please describe the feature you want to add. A clear and concise description of what the feature. Ex. I'm going to implement ...

Improvement performance of IQN

Additional requirement A clear and concise description of additional requirement for the new feature

Reference Please append the reference about the feature

erinn-lee commented 2 years ago

Benchmarks of JORLDY agents https://www.notion.so/Benchmark-09684f1adf764c84a5a331cb5690544f

Models with IQN networks have poor performance. [ I ] Agents which series of IQN have lower performances than other Distributional RL agents. [ II ] The n-step option tends to destabilize the performance of the Rainbow IQN

"Agents which series of IQN have lower performances than other Distributional RL agents"

스크린샷 2022-05-31 오후 2 57 06 스크린샷 2022-05-31 오후 3 01 04

IQN agent has lower or same performance comparing with C51 and QR-DQN. Please, refer the link on top. M-IQN which applied 'Muchausen' RL technique also has lower or same performance too. Their performances should be enhanced.

"The n-step option tends to destabilize the performance of the Rainbow IQN"

스크린샷 2022-05-31 오후 3 02 04

The performance of Rainbow IQN is unstable. Especially, it is vulnerable about Breakout task when the agent update by using n-step TD error. Their performances should be enhanced.