plot rewards as a function of number of timesteps

dgriff777 / rl_a3c_pytorch

A3C LSTM Atari with Pytorch plus A3G design

Apache License 2.0

562 stars 119 forks source link

plot rewards as a function of number of timesteps #30

Open rachit-dubey opened 6 years ago

rachit-dubey commented 6 years ago

Hi, thanks so much for the excellent codebase. Just wondering, is there any way to plot the training curve as a function of timesteps (as opposed to plotting the training curve as a function of time passed)?

Thanks!

dgriff777 commented 6 years ago

As the training cases are in asynchronous manner. If you need such data and want accurate number and not hinder training performance then probably best to keep a tally in each parallel process. Then set up to combine total by sending that data to a newly created process that would do the actual summation of.