I have been training it for about 4000 episodes(about 7 hours on one NVIDIA GeForce GTX1060 GPU), but the performance is still not good.I don't know how to observe the training process such as loss with the help of tensorboard. Could you please give me some advice?
I have been training it for about 4000 episodes(about 7 hours on one NVIDIA GeForce GTX1060 GPU), but the performance is still not good.I don't know how to observe the training process such as loss with the help of tensorboard. Could you please give me some advice?
merged = tf.summary.merge_all() sess.run(merged,feed_dict={})
1.Where should I run merged? 2.What is the content of the feed_dict?
I believe your paper and code are both excellent.What I am worried about is whether rl algorithms can work as well as the paper.