zt95 / infinite-horizon-off-policy-estimation

13 stars 7 forks source link

infinite-horizon-off-policy-estimation

This repository contains an implementation for following paper: Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation

Citation

If you find this article useful please consider citing:

    @article{liu2018breaking,
      title={Breaking the curse of horizon: Infinite-horizon off-policy estimation},
      author={Liu, Qiang and Li, Lihong and Tang, Ziyang and Zhou, Dengyong},
      journal={arXiv preprint arXiv:1810.12429},
      year={2018}
    }