HumanCompatibleAI / evaluating-rewards

Library to compare and evaluate reward functions
https://arxiv.org/abs/2006.13900
Apache License 2.0
61 stars 7 forks source link

Divergence of gridworld rewards and reward heatmap improvements #11

Closed AdamGleave closed 4 years ago

AdamGleave commented 4 years ago

New features:

TODO:

codecov[bot] commented 4 years ago

Codecov Report

Merging #11 into master will increase coverage by 2.75%. The diff coverage is 93.62%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master      #11      +/-   ##
==========================================
+ Coverage   81.42%   84.17%   +2.75%     
==========================================
  Files          43       45       +2     
  Lines        2745     2799      +54     
==========================================
+ Hits         2235     2356     +121     
+ Misses        510      443      -67
Impacted Files Coverage Δ
src/evaluating_rewards/analysis/stylesheets.py 71.42% <ø> (ø) :arrow_up:
tests/test_scripts.py 100% <100%> (ø) :arrow_up:
src/evaluating_rewards/analysis/visualize.py 83.68% <100%> (+0.44%) :arrow_up:
...c/evaluating_rewards/analysis/gridworld_rewards.py 100% <100%> (ø)
...uating_rewards/analysis/plot_divergence_heatmap.py 69.89% <100%> (-0.95%) :arrow_down:
src/evaluating_rewards/tabular.py 49.48% <84.84%> (+49.48%) :arrow_up:
...c/evaluating_rewards/analysis/gridworld_heatmap.py 96.12% <90.9%> (-1.93%) :arrow_down:
...ting_rewards/analysis/plot_gridworld_divergence.py 94.52% <94.52%> (ø)
...aluating_rewards/analysis/plot_gridworld_reward.py 97.72% <97.72%> (ø)
... and 3 more

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update 6bdeec5...88afaee. Read the comment docs.