eric-mitchell / macaw

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]
46 stars 11 forks source link