sauxpa / neural_exploration

Study NeuralUCB and regret analysis for contextual bandit with neural decision
89 stars 24 forks source link