FazelYU / Adaptive-Navigation

9 stars 0 forks source link

invalid actions of the agent #2

Closed FazelYU closed 2 years ago

FazelYU commented 3 years ago

a router may route to a road with no connection

FazelYU commented 3 years ago

subs_edge=random.choice(traci.lane.getLinks(road_ID+"0"))[0].split()[0]

FazelYU commented 3 years ago
FazelYU commented 3 years ago
FazelYU commented 3 years ago
FazelYU commented 3 years ago

Next state = current state / the random chosen next state/ the inavlid chosen next state Done = false

FazelYU commented 3 years ago

We can allow the U-turn for now

FazelYU commented 3 years ago

Next state = the random chosen next state

FazelYU commented 2 years ago

https://ai.stackexchange.com/questions/2980/how-should-i-handle-invalid-actions-when-using-reinforce

FazelYU commented 2 years ago

improvement: options:

FazelYU commented 2 years ago

Done: discard the invalid actions in the first place. Mask the output layer of the Q-Network over the valid actions only.