-
### Description
I try heavily to get **cb_explore_adf** working in an example application. I started with **cb_explore** that worked nicely. My data shows that the algorithm is able to explore and …
-
I have noticed there have been a series of changes that make setup.py download the JS dependencies during installation. While I appreciate the effort and am sure this was done with good intentions, **…
-
I was trying to use the `fit()` function to update a model with the rewards I got after issuing a batch of 100 actions. This was the first batch of actions that the model had issued and it was all the…
-
Hi,
A group I was in tried to reproduce some of the results from the paper associated with this repo.
To be honest, it was a bit more rushed than I'd like, there is probably a fair amount we co…
sroy2 updated
3 years ago
-
### Description
> I need to make a recommender based on implicit feedback, like views, page visits. Another package with ALS supports this, but that package doesn't have online (streaming) learning…
-
### Description
> After reading the documentations, I couldn't find any information regarding continuous action space. I am wondering how should I handle this and if I were to discretize the action…
-
From a gitter question
#### To Reproduce
input
```
1:1:1.0 2:2 3:3 4:4 | a b c
1:1 2:2:1.0 3:3 4:4 | a b c
1:1 2:2 3:3:1.0 4:4 | a b c
1:1 2:2 3:3 4:4:1.0 | a b c
1:1 2:2:0.7 3:3 4:4 | d e f…
-
Hi,
I am looking for a way to output the probabilities for all the actions for a given context, but cant find a way to do so. The 'emit_log_prob' always gives the value 0 for the action chosen. i …
-
Currently, contextual linear and logistic bandit algorithms share the same superclass `BaseContextualPolicy`.
The constructor of `BaseContextualPolicy` has alpha_ and lambda_ as arguments:
https:/…
-
I'm using VW in daemon mode with cb_explore_adf in order to implement a contextual bandit system that optimises the click-through rate of adverts. The documentation states that the ADF learning mode i…