Value loss explodes in Atari Venture for both PPO and RND

seungjaeryanlee / agents

TF-Agents is a library for Reinforcement Learning in TensorFlow

Apache License 2.0

1 stars 0 forks source link

Value loss explodes in Atari Venture for both PPO and RND #15

Open seungjaeryanlee opened 5 years ago

seungjaeryanlee commented 5 years ago

value_loss_explosion

As shown in the figure above, the value estimation loss explodes. This is shown both in PPO (shown above) and RND, so I am assuming that it might be due to how I am handling Atari environment.