Closed alexnikulkov closed 1 year ago
This pull request was exported from Phabricator. Differential Revision: D41226450
Base: 87.63% // Head: 87.69% // Increases project coverage by +0.05%
:tada:
Coverage data is based on head (
c5942b7
) compared to base (ff1ff09
). Patch coverage: 96.62% of modified lines in pull request are covered.
:umbrella: View full report at Codecov.
:loudspeaker: Do you have feedback about the report comment? Let us know in this issue.
This pull request was exported from Phabricator. Differential Revision: D41226450
This pull request has been merged in facebookresearch/ReAgent@25bafe6e3ad4ecf12bc6ab128d31ab140aa8febc.
Summary: Add Offline Evaluation for non-stationary Contextual Bandit policies. This diff includes only the Policy Evaluator algorithms from the LinUCB paper: https://arxiv.org/pdf/1003.0146.pdf (Algorithm 3)
Differential Revision: D41226450