Closed maximecb closed 5 years ago
Hi!
Please check out this issue #10. Specifically this answer from @danijar:
Nice! The nan is not a problem at all, it just means that there is no score during iterations where no planning happens. The most important summaries at the "cem" scalar summaries that show the planning performance. If any of the "divergence" scalar summaries is at zero the
divergence_scale
is too high. Besides that, you can look at the "openloop" image summaries to see future frames imagined by the agent and at the "cem" image summaries to see the planning policy in the environment.
You're also welcome to add PR that fixes those warnings 😄 Does it answer you concerns or is there anything more?
Is there an easy way to visualize the progress of training? I saw that there is code in there to generate gifs. Is that called automatically (if so, where do the gifs end up)? Is there a command-line option I should specify?
Hi @maximecb, thanks for reaching out. As @piojanu pointed out, the nans are expected. The GIFs are generated automatically, as long as ffmpeg
is available. They are written as TensorBoard summaries together with all other metrics. Just point a TensorBoard to the log directory.
After a tedious installation process, I just got the training script to run. I am getting some nan values in my console output:
Training command used:
I was just wondering if this is normal, or if there's something wrong with my installation.
PS, I am also getting many identical warnings from numpy, which should be easy to fix. Mentioning this because it really pollutes the console output: