-
Is contextual bandits in the scope of this library? Here is a paper for reference:
https://arxiv.org/pdf/1810.09558.pdf
htcml updated
3 years ago
-
Hi,
There are several usage questions with Contextual Bandits, that I'd be happy to incorporate in the Wiki and stackoverflow.
2. is progressive validation applied when training with IPS? I'm no…
-
The loss calculation for CB reductions is not consistent and not well documented. The current situation is:
- `cb_adf` records loss as calculated by an IPS estimator, except for if CB type DR or DM…
-
I'm trying to fit LinearDML and SparseLinearDML on a marketing data set with 350,000 examples, 25 real valued treatment variables, and 50 nuisance variables. The fitting takes a long time (and eventua…
-
Hello world,
This is probably not a significant issue, but it can be quite confusing especially for beginners.
In the VW contextual bandits tutorial (https://vowpalwabbit.org/tutorials/contextua…
-
Hi,
I am currently dealing with "agents/tf_agents/bandits/" . I am wondering where or if the classic Contextual Bandit off-policy evaluation procedures are present in Tensorflow.I mean exactly the…
-
Spatial correlation among frequency bands- MABs or Contextual Bandits with arm correlation in the presence of latent PUs
-
Go to the `docs/source/usage/tutorials` and add separate `.md` files to explain the following:
- [x] Using A2C (@Darshan-ko )
- [ ] Using PPO1
- [x] Using VPG (@Devanshu24 )
- [ ] Using DQN(s)
- …
-
http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1004967 via @sharkovsky
This thread is to collect links and discussions related to the ideas in the paper.
First thoughts: […
-
I am using the tf_Agents library for contextual bandits usecase.
In this usecase predictions (daily range between 20k and 30k predictions, 1 for each user) are made daily (multiple times a day) a…