peteflorence / Machine-Learning-6.867-homework

53 stars 34 forks source link

SARSA #32

Open manuelli opened 8 years ago

manuelli commented 8 years ago
manuelli commented 8 years ago

Currently the discrete version of this is working. 4 inner and 4 outer bins seems to work reasonably well. Have implemented the continuous version with function approximation. Now need to test it our for convergence.