hpi-sam / rl-4-self-repair

Reinforcement Learning Models for Online Learning of Self-Repair and Self-Optimization
MIT License
0 stars 1 forks source link

Implement SARSA(lambda) with Eligibility Traces #10

Closed christianadriano closed 4 years ago

christianadriano commented 4 years ago

Implement the n-step Sarsa(lambda) with Eligibility Traces Evaluate it against the current environment with different data transformations What is the optimal parameterization of the algorithm for different number of concurrent failures?

2start commented 4 years ago

Implemented