hongzimao / input_driven_rl_example

Variance Reduction for Reinforcement Learning in Input-Driven Environments (ICLR '19)
https://people.csail.mit.edu/hongzi/var-website/index.html
MIT License
31 stars 10 forks source link

Input-dependent Baseline

Input-dependent baseline for reducing the variance from external input processes.

Paper: https://openreview.net/forum?id=Hyg1G2AqtQ

Example

Dependencies

Python 3.6, Tensorflow 1.2.1, Numpy 1.14.5