Open asliyigit2 opened 1 week ago
Thank you for your attention.
Figure 5 plots the mean and variance of the moving average reward, which means if you got five moving average rewards (the orange curve in your training reward plot) with five random seeds, you can obtain mean and variance as Fig. 5 shows.
Thank you for your returns.
Can you share the source codes?
Thank you for your work.
I reviewed your codes and trained them for 500 episodes.
However, when I examined the code, I could not find how you drew the Convergence curve comparison graph in Figure 5 in the article.
I share the chart below. I would like your guidance and help in drawing the graph given in Figure 5.