Closed sidorov-ks closed 7 years ago
Added bare-bones version of optimailty tightening for Q-learning (as described in arXiv:1611.01606)
Seems great! Could you please make it follow the same pattern of inputs as the basic qlearning? (to make transition easier)
Added bare-bones version of optimailty tightening for Q-learning (as described in arXiv:1611.01606)