-
`v_array` uses [`std::realloc`](https://en.cppreference.com/w/cpp/memory/c/realloc) to obtain better performance than `std::vector` in many places. However, as `realloc` can copy the piece of memory w…
-
https://github.com/VowpalWabbit/vowpal_wabbit/blob/3f5793d9920fdff44f674cd4252180411da1cff5/vowpalwabbit/conditional_contextual_bandit.cc#L113-L120
In debug mode, when the number of slots input to …
-
### Problem Description
I have trained an online contextual bandit using `--save_resume`. I was attempting to debug interaction coefficients, and unfortunately `--invert_hash` for `cb_explore_adf` w…
-
-
We now have `SuccessiveHalvingClassifier` and `SuccessiveHalvingRegressor` in the `model_selection` to module to perform, well, model selection. This allows doing hyperparameter-tuning by initializing…
-
Hi team,
Here's the code that trains and saves a Bandit policy
```
import numpy as np
import tensorflow as tf
from tf_agents.bandits.agents import lin_ucb_agent
from tf_agents.bandits.enviro…
-
I'm getting the following error when running an `Exp3MixtureAgent` with two agents:
```
File "/Users/michalw/git/tf-agents-contextual-bandits/.venv-tf-agents-contextual-bandits/lib/python3.7/site-p…
waral updated
4 years ago
-
Follow up to #6253. The issue is still reproducible.
## Description
While we currently have a feedback tool for ads which will limit delivery of downvoted ads, there have been many requests to li…
-
-
I am using Vowpal Wabbit's contextual bandit to rank various action given a context.
Train Data:
"1:10:0.1 | 123"
"2:9:0.1 | 123"
"3:8:0.1 | 123"
"4:7:0.1 | 123"
"5:6:0…