hyperparameters for multi-armed bandit envs

tristandeleu / pytorch-maml-rl

Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch

MIT License

827 stars 158 forks source link

Open VashishtMadhavan opened 6 years ago

VashishtMadhavan commented 6 years ago

do you happen to have the hyperparameters for the multi-armed bandit experiments. im trying to compare with the results of Duan et al. 2016