gabriquaranta / computational-intelligence

0 stars 0 forks source link

Lab 10 review by Giorgio Cacopardi (s309685) #4

Open GioC1810 opened 6 months ago

GioC1810 commented 6 months ago

Hi Gabriele, First of all well done, your code was very clear to me. I find it very good that you trained the q-agent not only through a random player but also using a min max agent, this definitely allows the q table to be more accurate as more important states are learned in order to optimize the moves. However, I have some advice that I think can make your code even better, mainly regarding your exploration-exploitation trade off balance:

I hope my suggestions are helpful to you and best of luck.