contextual-bandit Search Results

254 results
for contextual-bandit

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

VowpalWabbit/vowpal_wabbit #3363

Improve docs for cb_explore_adf to make it easier to use

### Description I try heavily to get **cb_explore_adf** working in an example application. I started with **cb_explore** that worked nicely. My data shows that the algorithm is able to explore and …

sesigl updated 3 years ago
2
projectmesa/mesa #1277

Download of JS dependencies at installation time

I have noticed there have been a series of changes that make setup.py download the JS dependencies during installation. While I appreciate the effort and am sure this was done with good intentions, **…

Corvince updated 2 years ago
33
LaunchpadAI/space-bandits #20

Model can't be initialized in linear.py using fit() if not a…

I was trying to use the `fit()` function to update a model with the rewards I got after issuing a batch of 100 actions. This was the first batch of actions that the model had issued and it was all the…

lucaslangwagen updated 3 years ago
4
david-cortes/contextualbandits #42

Reproducibility issue with current branch of library?

Hi, A group I was in tried to reproduce some of the results from the paper associated with this repo. To be honest, it was a bit more rushed than I'd like, there is probably a fair amount we co…

sroy2 updated 3 years ago
1
VowpalWabbit/vowpal_wabbit #3112

Implicit feedback for recommendation systems

### Description > I need to make a recommender based on implicit feedback, like views, page visits. Another package with ALS supports this, but that package doesn't have online (streaming) learning…

dminovski0 updated 3 years ago
3
VowpalWabbit/vowpal_wabbit #2427

Handling contextual bandit problem with continuous action sp…

### Description > After reading the documentations, I couldn't find any information regarding continuous action space. I am wondering how should I handle this and if I were to discretize the action…

hardianlawi updated 3 years ago
10
VowpalWabbit/vowpal_wabbit #3108

cb to cb_adf with multiple costs

From a gitter question #### To Reproduce input ``` 1:1:1.0 2:2 3:3 4:4 | a b c 1:1 2:2:1.0 3:3 4:4 | a b c 1:1 2:2 3:3:1.0 4:4 | a b c 1:1 2:2 3:3 4:4:1.0 | a b c 1:1 2:2:0.7 3:3 4:4 | d e f…

olgavrou updated 3 years ago
3
tensorflow/agents #592

Probability of all actions in Contextual Bandits (LinUCB)

Hi, I am looking for a way to output the probabilities for all the actions for a given context, but cant find a way to do so. The 'emit_log_prob' always gives the value 0 for the action chosen. i …

kmalhotra7 updated 3 years ago
12
st-tech/zr-obp #66

alpha_ and lambda_ are not necessary for contextual linear b…

Currently, contextual linear and logistic bandit algorithms share the same superclass `BaseContextualPolicy`. The constructor of `BaseContextualPolicy` has alpha_ and lambda_ as arguments: https:/…

Kurorororo updated 3 years ago
2
VowpalWabbit/vowpal_wabbit #1306

Questions about removing actions (arms) from the ADF learner…

I'm using VW in daemon mode with cb_explore_adf in order to implement a contextual bandit system that optimises the click-through rate of adverts. The documentation states that the ADF learning mode i…

maxpagels updated 3 years ago
5

上一页 1...11 12 13 14 15 16 17...26 下一页

254 results for contextual-bandit

254 results
for contextual-bandit