sawcordwell / pymdptoolbox

Markov Decision Process (MDP) Toolbox for Python
BSD 3-Clause "New" or "Revised" License
518 stars 252 forks source link

Improper Assertion Statement. #29

Open ryanpeach opened 5 years ago

ryanpeach commented 5 years ago
assert self.max_iter >= 10000, "'n_iter' should be greater than 10000."

This line in QLearning is inconvenient and arbitrary. It should be either removed or turned into a warning.

I am interested in hacking these classes, for instance, such that they can iteratively learn as the Reward matrix changes. As such, small values of this value are necessary.