-
The example in https://github.com/VowpalWabbit/vowpal_wabbit/wiki/Logged-Contextual-Bandit-Example assumes one knows the action probabilities. However, in many cases, these probabilities are unknown a…
-
At first, thank you a lot for your contributions. They are very valuable to improve my understanding of the original paper.
I have a fundamental question regarding the implementation of the [Neural…
-
I found an error if `beta_prior` is set to be `"auto"` and `nchoices` is a `list` when we initialize the online contextual bandits.
Here is the error message:
```
File "/usr/local/lib/python3.…
-
Hi Mark,
We currently use the `LoggedInteraction`'s IPS estimator to compare the accumulated reward of VW models with non-VW baselines, such as the random policy, to analyze if there's something fo…
-
Hello @david-cortes, thanks for this Contextual Bandits package.
While using some of the online methods (BootstrappedTS, AdaptiveGreedy, maybe some others) from this package, I've faced some unexpe…
-
Hi!
When passing context features as a dictionary with the keys being their names they don't seem to be processed properly.
While stepping through the code I believe the issue is in `_prep_namespac…
-
### What happened + What you expected to happen
I want to train on custom contextual bandits with multi discrete actions, but torch throws an error because of unexpected tensor-shapes:
Failure…
-
**Submitting author:** @N-Wouda (Niels Wouda)
**Repository:** https://github.com/N-Wouda/ALNS
**Branch with paper.md** (empty if default branch): joss-paper
**Version:** v5.0.4
**Editor:** @hugoledoux…
-
- News
- Deadline
- Interspeech 2022, 수고 많으셨습니다!
- ICML 22: Review out (4. 7, 저녁)
- [인공지능과 지식재산백서](https://www.kipo.go.kr/ko/kpoBultnDetail.do?ntatcSeq=16558&aprchId=BUT0000048&searchC…
-
Hi everyone,
I am working on solving a peg-in hole problem. Initially I stared with RL approach but it seems like its not the right approach for my problem.
**Task description**
**Setup**: Rob…