rail-berkeley / rlkit

Collection of reinforcement learning algorithms
MIT License
2.51k stars 554 forks source link

IQL reimplementation and examples. #150

Closed anair13 closed 3 years ago

anair13 commented 3 years ago

IQL (https://arxiv.org/abs/2110.06169) reimplementation and examples. Also changed batch_rl_algorithm to be offline RL for negative epochs.