hpi-sam / rl-4-self-repair

Reinforcement Learning Models for Online Learning of Self-Repair and Self-Optimization
MIT License
0 stars 1 forks source link

Discount and gain factor #3

Closed brrrachel closed 4 years ago

brrrachel commented 4 years ago

@christianadriano @MrBanhBao To sum up what I understood so far ...

Basic concept: over time the reward for a certain repair action should decrease

christianadriano commented 4 years ago

Correct. Nothing to add to these definitions.