Good day,
I am experiencing add issue while testing a simple custom environment with default DQN agent.
I have state space PlanarMapsObservationSpace(shape=np.array([50, 50, 1]), low=0, high=1) and discrete action space with 4 actions (move up/down/left/right). At one random point on the map is spawned "target" (value 1) and elsewhere is "nothing" (value 0). The goal of the agent is to move the target to the middle of the map. A reward is a negative manhattan distance between the target and middle of the map.
The issue is that from the first evaluation phase agent chose one direction and blindly move the target in this direction.
Good day, I am experiencing add issue while testing a simple custom environment with default DQN agent. I have state space
PlanarMapsObservationSpace(shape=np.array([50, 50, 1]), low=0, high=1)
and discrete action space with 4 actions (move up/down/left/right). At one random point on the map is spawned "target" (value 1) and elsewhere is "nothing" (value 0). The goal of the agent is to move the target to the middle of the map. A reward is a negative manhattan distance between the target and middle of the map. The issue is that from the first evaluation phase agent chose one direction and blindly move the target in this direction.My code here: Environment:
Take_action:
Update_state:
Restart_environment:
Preset:
If anyone has any idea how to solve this please reply. Thanks in advance