HumanCompatibleAI / evaluating-rewards

Library to compare and evaluate reward functions
https://arxiv.org/abs/2006.13900
Apache License 2.0
61 stars 7 forks source link

Pathological visitation distributions #48

Closed AdamGleave closed 3 years ago

AdamGleave commented 3 years ago

Add experiments for badly-behaved (or "pathological") visitation distributions to highlight failure modes of EPIC.

Concrete changes:

codecov[bot] commented 3 years ago

Codecov Report

Merging #48 (63477ab) into master (e14bb9e) will decrease coverage by 1.55%. The diff coverage is 75.21%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master      #48      +/-   ##
==========================================
- Coverage   85.14%   83.59%   -1.56%     
==========================================
  Files          65       68       +3     
  Lines        4262     4583     +321     
==========================================
+ Hits         3629     3831     +202     
- Misses        633      752     +119     
Impacted Files Coverage Δ
src/evaluating_rewards/distances/common_config.py 94.73% <ø> (ø)
src/evaluating_rewards/envs/point_mass.py 83.33% <ø> (ø)
src/evaluating_rewards/scripts/distances/common.py 91.85% <ø> (ø)
...ating_rewards/scripts/rewards/train_adversarial.py 0.00% <ø> (ø)
...ating_rewards/scripts/rewards/train_preferences.py 97.87% <ø> (ø)
...valuating_rewards/scripts/rewards/train_regress.py 95.12% <ø> (ø)
src/evaluating_rewards/scripts/train_rl.py 0.00% <0.00%> (ø)
src/evaluating_rewards/analysis/results.py 37.61% <25.00%> (+4.58%) :arrow_up:
src/evaluating_rewards/envs/mujoco.py 88.09% <54.54%> (-10.07%) :arrow_down:
src/evaluating_rewards/datasets.py 59.86% <58.82%> (-10.88%) :arrow_down:
... and 21 more

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update d8256b3...63477ab. Read the comment docs.