Closed nanli42 closed 4 years ago
Hi @nanli42, Thanks for your contribution. I agree with your opinion. But, I'm busy due to other projects. I will test it and tell you. Thank you for your interest.
Gilbert.
@nanli42
Thank you for your comments. We will take them into consideration when we release the next update (ROS 2 Dashing Diademata)
Thank you very much, Ryan
Hello,
Firstly thanks for your excellent example of DQN on TurtleBot3!
But it seems that there are some problems about the training batch setting in function def trainModel() from "_turtlebot3_machine_learning/turtlebot3_dqn/nodes/turtlebot3_dqnstage*_":
If some action is done, it means that it reaches the goal or the obstacle, or it get the TIMEOUT. Thus it seems useless to save the next state. The rewards have been distinguished regard to done or not in the function def getQvalue(). So I think it is better to remove the code if dones:.... Otherwise, a consequence could be: when we fit the model in
the size of X_batch and Y_batch could exceed the batch_size.
Maybe I am wrong, then please let me know! Thanks in advance! :) Nan