Open GoogleCodeExporter opened 9 years ago
Hi. I think the problem is my implementation of OptimisticValueIteration. To make it stable for gamma = 1, you need to check some alternative convergence condition. There are also a couple of other ways to improve it: normally to get the tightest possible bound on the transitions you need to solve a linear program.
If you can help fixing it, it'd be nice.
Original issue reported on code.google.com by
Ian.Osb...@gmail.com
on 12 May 2013 at 6:16