Open eakirtas opened 3 years ago
@ManosMagnus I have fixed the issue that robot stop moving after the first episode by adding two lines of code. However, it seems weird that there are several additional resets. Did you design this for some purposes?
Currently, it seems to be able to converge.
Link to My fork
However, the robot doesn't reach the target when using agent.choose_action_test()
instead of agent.choose_action_train()
after the training is done.
Still trying to figure out how to fix it.:thinking:
Currently, agent of find and avoid seems that it is not converge