Understanding tensorboard outputs of rl-coach model

IntelLabs / coach

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

Apache License 2.0

2.32k stars 460 forks source link

Hi all,

Related the issue that was closed some days ago ( #374 ), the layers output is somewhat confusing when printing the graph. This also extrapolates to the outputs seen on tensorboard when saving the logs via VisualizationParameters param of tensorboard.

I think that having some guidance in the docs or in the naming of the different layers/components will be crucial to understand and interprete if the agents are learning correctly and if there could be some tweaking that could help them learn better.

¿Any insights about this @galnov @gal-leibovich and etc?

As always, I'm keen on helping as much as I can once I know more (rewriting docs or whatever).

Thanks in advance,

IntelLabs / coach

Understanding tensorboard outputs of rl-coach model #381