Closed vwxyzjn closed 4 years ago
Issue-Label Bot is automatically applying the label bug
to this issue, with a confidence of 0.70. Please mark this comment with :thumbsup: or :thumbsdown: to give our bot feedback!
Links: app homepage, dashboard and code for this bot.
Hi Costa, thanks for the awesome detailed bug report, and solid suggestion on the solution here.
@cvphelps Thanks for the kind words. I am still trying to work on my project on Open RL Benchmark (https://app.wandb.ai/cleanrl/cleanrl.benchmark/reports/benchmark--Vmlldzo0MDcxOA). I hope I will have a major release soon so that I could submit it for potentially being featured in the wandb gallery :)
Great, thanks again. Closing this as a duplicate of this ticket: https://github.com/wandb/client/issues/1093
wandb --version && python --version && uname
wandb, version 0.9.1 Python 3.7.4 LinuxDescription
There are two issues with the current support for gym logging.
1) Inconsistent slider steps for the video panel: The media slider does not allow to use a custom x-axix, resulting in drastic different scale of x-axis for value-based methods vs policy gradient methods. Here are some examples:
The video panel of DQN
The video panel of PPO (PPO does less gradient update and therefore
step
is lowerAs we can see, the slider step for the video panel is drastically different even though both scripts ran for 10M timesteps and there are roughly 17 videos for both scripts.
2) Bugs of video logging with
SubprocVecEnv
SubprocVecEnv
is a common module fromopenai/baselines
(https://github.com/openai/baselines/blob/master/baselines/common/vec_env/subproc_vec_env.py), which allows the use of subprocesses to speed up sampling collection from parallelized gym environments. However, this causes issues for wandb video logging.Not using
SubprocVecEnv
https://app.wandb.ai/cleanrl/cleanRL/runs/gayxplrt/overview?workspace=user-costa-huang
Using
SubprocVecEnv
https://app.wandb.ai/cleanrl/cleanRL/runs/rmqlq8m4/overview?workspace=user-costa-huang When using the
SubprocVecEnv
, not only the steps of the slider becomes incorrect.More importantly, when you have more videos, it does not even allow you to slide over all the videos.What I Did
Proposed solutoin
It seems to me that the most straight forward approach is to allow the use of custom x-axis for the steps of the slider, which is related to issue https://github.com/wandb/client/issues/1093