tristandeleu / pytorch-maml-rl

Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
MIT License
827 stars 158 forks source link

hyperparameters for multi-armed bandit envs #23

Open VashishtMadhavan opened 6 years ago

VashishtMadhavan commented 6 years ago

do you happen to have the hyperparameters for the multi-armed bandit experiments. im trying to compare with the results of Duan et al. 2016