LucasAlegre / morl-baselines

Multi-Objective Reinforcement Learning algorithms implementations.
https://lucasalegre.github.io/morl-baselines
MIT License
295 stars 47 forks source link

Refactor Multi-Policy MO-Qlearning #37

Closed LucasAlegre closed 1 year ago

LucasAlegre commented 1 year ago

Refactor MPMOQLearning such that is can use OLS or GPI-LS inside train() method.