-
Is contextual bandits in the scope of this library? Here is a paper for reference:
https://arxiv.org/pdf/1810.09558.pdf
htcml updated
3 years ago
-
In contextual setting, should be able to replicate estimates, conditional on gammahats: https://github.com/gsbDBI/contextual_bandits_evaluation/tree/main/experiments
-
Hi,
There are several usage questions with Contextual Bandits, that I'd be happy to incorporate in the Wiki and stackoverflow.
2. is progressive validation applied when training with IPS? I'm no…
-
The loss calculation for CB reductions is not consistent and not well documented. The current situation is:
- `cb_adf` records loss as calculated by an IPS estimator, except for if CB type DR or DM…
-
This is the meta-issue tracking progress with GOLEM paper that introduces GOLEM framework with use-cases and adaptive features.
*What's required from collaborators who add their use-cases*
- Use-c…
-
Spatial correlation among frequency bands- MABs or Contextual Bandits with arm correlation in the presence of latent PUs
-
Hello world,
This is probably not a significant issue, but it can be quite confusing especially for beginners.
In the VW contextual bandits tutorial (https://vowpalwabbit.org/tutorials/contextua…
-
Hi,
I am currently dealing with "agents/tf_agents/bandits/" . I am wondering where or if the classic Contextual Bandit off-policy evaluation procedures are present in Tensorflow.I mean exactly the…
-
Formulate the problem for a simple topology: 1 PU and 1 SU. Do the math showing the approach to tackle the learning problem or action space approximation problem by using the correlation between bands…
-
http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1004967 via @sharkovsky
This thread is to collect links and discussions related to the ideas in the paper.
First thoughts: […