-
Is contextual bandits in the scope of this library? Here is a paper for reference:
https://arxiv.org/pdf/1810.09558.pdf
htcml updated
3 years ago
-
Hi,
There are several usage questions with Contextual Bandits, that I'd be happy to incorporate in the Wiki and stackoverflow.
2. is progressive validation applied when training with IPS? I'm no…
-
I'm trying to fit LinearDML and SparseLinearDML on a marketing data set with 350,000 examples, 25 real valued treatment variables, and 50 nuisance variables. The fitting takes a long time (and eventua…
-
The loss calculation for CB reductions is not consistent and not well documented. The current situation is:
- `cb_adf` records loss as calculated by an IPS estimator, except for if CB type DR or DM…
-
Spatial correlation among frequency bands- MABs or Contextual Bandits with arm correlation in the presence of latent PUs
-
Hello world,
This is probably not a significant issue, but it can be quite confusing especially for beginners.
In the VW contextual bandits tutorial (https://vowpalwabbit.org/tutorials/contextua…
-
Hi,
I am currently dealing with "agents/tf_agents/bandits/" . I am wondering where or if the classic Contextual Bandit off-policy evaluation procedures are present in Tensorflow.I mean exactly the…
-
Formulate the problem for a simple topology: 1 PU and 1 SU. Do the math showing the approach to tackle the learning problem or action space approximation problem by using the correlation between bands…
-
Go to the `docs/source/usage/tutorials` and add separate `.md` files to explain the following:
- [x] Using A2C (@Darshan-ko )
- [ ] Using PPO1
- [x] Using VPG (@Devanshu24 )
- [ ] Using DQN(s)
- …
-
http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1004967 via @sharkovsky
This thread is to collect links and discussions related to the ideas in the paper.
First thoughts: […