krishnaph23 / Smart-Traffic-Light-Controller-using-Deep-Reinforcement-Learning

1 stars 1 forks source link

Reward #2

Closed And-93 closed 2 years ago

And-93 commented 2 years ago

how is the reward calculated? Could you tell me what data the reward takes into account?

krishnaph23 commented 2 years ago

https://github.com/krishnaph23/Smart-Traffic-Light-Controller-using-Deep-Reinforcement-Learning/blob/34701cdb8dcdbf812bce806d63d1287fedce1014/ANN/3-way/training_simulation.py#L77

The reward is the difference between the current wait time and the previous wait time? Essentially, the lesser the wait time more the reward would be

And-93 commented 2 years ago

thank you, by chance is there any thesis or file that best describes all this your work that maybe I can ask?

Inviato da Postahttps://go.microsoft.com/fwlink/?LinkId=550986 per Windows

Da: Krishna P @.> Inviato: giovedì 3 febbraio 2022 13:48 A: @.> Cc: @.>; @.> Oggetto: Re: [krishnaph23/Smart-Traffic-Light-Controller-using-Deep-Reinforcement-Learning] Reward (Issue #2)

https://github.com/krishnaph23/Smart-Traffic-Light-Controller-using-Deep-Reinforcement-Learning/blob/34701cdb8dcdbf812bce806d63d1287fedce1014/ANN/3-way/training_simulation.py#L77

The reward is the difference between the current wait time and the previous wait time? Essentially, the lesser the wait time more the reward would be

— Reply to this email directly, view it on GitHubhttps://github.com/krishnaph23/Smart-Traffic-Light-Controller-using-Deep-Reinforcement-Learning/issues/2#issuecomment-1028955250, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AXOVQTSRNJLDZAU66FR2PNLUZJ2P3ANCNFSM5NO2DZSA. Triage notifications on the go with GitHub Mobile for iOShttps://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Androidhttps://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub. You are receiving this because you authored the thread.Message ID: @.***>