contextual-bandits Search Results

Alanthink/banditpylib #15

Contextual Bandits?

Is contextual bandits in the scope of this library? Here is a paper for reference: https://arxiv.org/pdf/1810.09558.pdf

htcml updated 3 years ago

VowpalWabbit/vowpal_wabbit #1542

progressive validation with Contextual Bandits

Hi, There are several usage questions with Contextual Bandits, that I'd be happy to incorporate in the Wiki and stackoverflow. 2. is progressive validation applied when training with IPS? I'm no…

matanox updated 4 years ago

VowpalWabbit/vowpal_wabbit #4495

Inconsistent and unclear loss calculation for contextual ban…

The loss calculation for CB reductions is not consistent and not well documented. The current situation is: - `cb_adf` records loss as calculated by an IPS estimator, except for if CB type DR or DM…

jackgerrits updated 1 year ago

py-why/EconML #922

Verbose logging in LinearDML and SparseLinearDML

I'm trying to fit LinearDML and SparseLinearDML on a marketing data set with 350,000 examples, 25 real valued treatment variables, and 50 nuisance variables. The fitting takes a long time (and eventua…

carl-offerfit updated 1 month ago

VowpalWabbit/vowpalwabbit.github.io #147

Displaying train and test data

Hello world, This is probably not a significant issue, but it can be quite confusing especially for beginners. In the VW contextual bandits tutorial (https://vowpalwabbit.org/tutorials/contextua…

guijoe updated 4 years ago

tensorflow/agents #791

Contextual Bandit Off-Policy Evaluation

Hi, I am currently dealing with "agents/tf_agents/bandits/" . I am wondering where or if the classic Contextual Bandit off-policy evaluation procedures are present in Tensorflow.I mean exactly the…

vitorkrasniqi updated 1 year ago

Purdue-University-ECE-CNSIP/research #1

Discussion: Spatial Correlation among bands

Spatial correlation among frequency bands- MABs or Contextual Bandits with arm correlation in the presence of latent PUs

bkeshava updated 6 years ago

SforAiDl/genrl #196

Usage explanatory docs

Go to the `docs/source/usage/tutorials` and add separate `.md` files to explain the following: - [x] Using A2C (@Darshan-ko ) - [ ] Using PPO1 - [x] Using VPG (@Devanshu24 ) - [ ] Using DQN(s) - …

sampreet-arthi updated 4 years ago

wildtreetech/advanced-comp-2017 #9

Reservoir computing

http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1004967 via @sharkovsky This thread is to collect links and discussions related to the ideas in the paper. First thoughts: […

betatim updated 7 years ago

tensorflow/agents #737

How to use the replay buffer in tf_agents for contextual ban…

I am using the tf_Agents library for contextual bandits usecase. In this usecase predictions (daily range between 20k and 30k predictions, 1 for each user) are made daily (multiple times a day) a…

tejavenkatk updated 2 years ago

130 results for contextual-bandits

130 results
for contextual-bandits