hpi-sam / rl-4-self-repair

Reinforcement Learning Models for Online Learning of Self-Repair and Self-Optimization
MIT License
0 stars 1 forks source link

Evaluate Q-Learning with Eligility Traces #11

Closed christianadriano closed 4 years ago

christianadriano commented 4 years ago

So it is comparable with the Sarsa implementation. Evaluate it for different parameterizations of the algorithm, different number of component failures, and different environments (data transformations).

2start commented 4 years ago

Implemented.