Optimise rl - Githubissues

theomarzaki / TrafficOrchestrator

Traffic Orchestrator for Central unit Processing of autonomous vehicle merging through the use of Reinforcment Learning

MIT License

2 stars 0 forks source link

Closed theomarzaki closed 5 years ago

theomarzaki commented 5 years ago

Log:

Changed Reward allocation (part of research)
removed RFC as a reward aid, provided much faster training times
changed IsTerminal for Model Learning to account extreme agent actions that resulted in ((-)inf,nan)
Save the rewards and loss over time for the models in a text file for further extrapolation