Adds ability to use 64bit transition and reward matrices

abhisheknaik96 / differential-value-iteration

Experiments in creating the ultimate average-reward planning algorithm

Apache License 2.0

0 stars 2 forks source link

Closed btanner closed 3 years ago

btanner commented 3 years ago

Also updates tests to run both ways with different tolerances.

All tolerances tuned to their limit.