Closed kivo360 closed 5 years ago
I don’t think I can help you with that. Don’t know of any user groups or forums dedicated to it. Perhaps you can try reddit.
Wow, that was a fast response. I'm reading up more on contextual and multi-arm bandits. If I can formulate a good question, would you be open to helping me answer? It's in regards to online learning. I feel that after reading up for a few hours I'll be able to ask a proper question.
Never mind, I was able to figure something out.
I'm extremely new to the subject of contextual bandits and reinforcement learning. What I am interested in is how people use contextual bandits for advertising at scale. How do you average the rewards out?