Bahador-Bakhshi / 5G-Federation

15 stars 0 forks source link

Dyna #17

Open Bahador-Bakhshi opened 3 years ago

Bahador-Bakhshi commented 3 years ago

Dyna needs more less episodes to converge

It seems that, in large problems, it is really beneficial to use it instead of direct Q-Learning