Closed HarukiKozukapenguin closed 2 years ago
hi,
1) the training reward is logged in tensorboard. you can to to saved and run
tensorboard --logdir=./
2) the plots in the first row are position [x, y, z] and the plots in the second row are velocity [x, y, z]
3) the plotting is done here
tensorboard --logdir=./
), or should I run this command b/f I run python3 -m python.run_vision_ppo --render 0 --train 1
?I move to a directory of saved/PPO_(num) and I run tensorboard --logdir=./
then I can seer transition of each reward.
Thank you!
@yun-long
I have one question about interpretation of TensorBoard. I can see rewards transition when I learns in the simulation, but I do not know how what does it means.
Hi,
the reward you see on Tensorboard is from here.
In summary, contains the sum reward and each individual reward component. The reward is a training reward, not an evaluation reward.
@yun-long Which code writes these rewards to TensorBoard?
Thank you!
Thank you for interesting simulator!
I checked
run_vision_ppo.py
by following command.python3 -m python.run_vision_ppo --render 0 --train 1
And, I found data when they train in theenvtest/python/saved
directory (e.g. PPO_1, PPO_2). I found some policies when they're training(/policy), and Test Trajectory(/TestTraj). The questions I would like to ask is as follows.