abhisheknaik96 / differential-value-iteration

Experiments in creating the ultimate average-reward planning algorithm
Apache License 2.0
0 stars 2 forks source link

Several fixes and cleanups #42

Closed btanner closed 2 years ago

btanner commented 2 years ago
btanner commented 2 years ago

I'm going to commit this and the following PR and then continue working.