-
We will use linear Thompson sampling
- https://arxiv.org/abs/1209.3352
- https://web.stanford.edu/~bvr/pubs/TS_Tutorial.pdf section 7.1 is exactly what we want to do
- investigate https://github.c…
-
In https://github.com/openjournals/joss-reviews/issues/5028, @skadio wrote:
> Inspired by your alpha-UCB policy in ALNS, let me add one potential integration opportunity for future.
>
> Similar …
-
First of all, thank you for fixing my issues #62.
I want to setting my CB model with custom 'choice_names' (integer) for using serial number of choices in my example data. I got TypeError, when …
-
微博内容精选
-
- I am trying to create a solution on AWS Personalise using custom hyperparameter config.
- This is the error I am facing.
```
InvalidInputException Traceback (most recent call …
-
Hello all!
I have been playing with LinUCB in an attempt to set up a recipe recommendation system using historical data. I have read through the original LinUCB paper as well as http://www.gatsby.u…
-
Hi,
I noticed that in the `fit` function of `_ThompsonSampling`, `contexts` is never passed to `self._parallel_fit(decisions, rewards)`. https://github.com/fidelity/mabwiser/blob/master/mabwiser/th…
-
First of all thank you for code to use CB : >
When I run your example notebook (online_contextual_bandits.ipynb), I get 'AssertionError' when i run '3.3 Streaming models' part. how can i get some h…
-
- Arthur Juliani. [Learning Policies For Learning Policies — Meta Reinforcement Learning (RL²) in Tensorflow](https://medium.com/hackernoon/learning-policies-for-learning-policies-meta-reinforcement-…
-
@41ow1ives @ryubright