abhisheknaik96 / differential-value-iteration

Experiments in creating the ultimate average-reward planning algorithm
Apache License 2.0
0 stars 2 forks source link

Project CheckPointing Merge #52

Closed btanner closed 2 years ago

btanner commented 2 years ago

This set of changes is the first (but major) step towards moving the in-flight code and experiments to a form that is suitable for sharing and archiving. Most code that does not contribute to our empirical results has been removed, comments and tests have been improved, etc.