eleurent / rl-agents

Implementations of Reinforcement Learning and Planning algorithms
MIT License
591 stars 153 forks source link

Fix olop mistake #25

Closed p-shg closed 4 years ago

p-shg commented 5 years ago

typo to use the config file to compute the UCB using KL

eleurent commented 5 years ago

Right, thanks! but you also included the safe deep copy work for SOFA, which is unrelated