Closed FazelYU closed 2 years ago
subs_edge=random.choice(traci.lane.getLinks(road_ID+"0"))[0].split()[0]
[x] The agent must be aware of the incoming edge of AV that sends the routing query
the state dimension would be different for different agents
Next state = current state / the random chosen next state/ the inavlid chosen next state Done = false
We can allow the U-turn for now
Next state = the random chosen next state
improvement: options:
Done: discard the invalid actions in the first place. Mask the output layer of the Q-Network over the valid actions only.
a router may route to a road with no connection