rasenqt / computational_intelligence23_24

Collection of assignments for CI course
0 stars 0 forks source link

Lab 4 Review by Borella Simone s317774 #8

Closed SimoneBorella closed 10 months ago

SimoneBorella commented 10 months ago

Hi Michelangelo, nice implementation of the Q-learning algorithm. I liked that you implemented several opponent strategies for testing your trained model, I don't understand why you don't use them also for training! You could reach better performances training your model not only with the random opponent strategy but also with more optimal strategies. Another point is the choice of a constant epsilon value. To encourage exploration a decreasing epsilon is recommended in my opinion to avoid leading into a suboptimal policy, due to the fact that your training agent tends to exploit the actual optimal policy, following the best action-value instead of making some random action for exploration. Anyway good job!

rasenqt commented 10 months ago

Thank you for the review, i was trying just to provide a benchmark suite in order to evaluate my method