rdgreene / sa_tsp

Q-Learning applied to the classic Travelling Salesman Problem
18 stars 3 forks source link

Implement 'n-step' Temporal Difference #21

Open miguelesteras opened 7 years ago