vlad17 / mve

MVE: model-based value estimation
Apache License 2.0
10 stars 0 forks source link

rm reward scaling #358

Closed vlad17 closed 6 years ago

vlad17 commented 6 years ago

it's cheating and should be replaced with algorithm-specific hyperparameters -- eg temperature