contextual-bandits Search Results

128 results
for contextual-bandits

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

VowpalWabbit/vowpal_wabbit #3910

logged contextual bandits without probabilities

The example in https://github.com/VowpalWabbit/vowpal_wabbit/wiki/Logged-Contextual-Bandit-Example assumes one knows the action probabilities. However, in many cases, these probabilities are unknown a…

chanansh updated 1 year ago
7
sauxpa/neural_exploration #3

NeuralUCB Confidence

At first, thank you a lot for your contributions. They are very valuable to improve my understanding of the original paper. I have a fundamental question regarding the implementation of the [Neural…

kaposnick updated 1 year ago
3
david-cortes/contextualbandits #59

Type error if beta_prior == "auto" and nchoices is list

I found an error if `beta_prior` is set to be `"auto"` and `nchoices` is a `list` when we initialize the online contextual bandits. Here is the error message: ``` File "/usr/local/lib/python3.…

songminhak updated 1 year ago
1
VowpalWabbit/coba #38

Advanced estimator for model performance evaluation

Hi Mark, We currently use the `LoggedInteraction`'s IPS estimator to compare the accumulated reward of VW models with non-VW baselines, such as the random policy, to analyze if there's something fo…

jonastim updated 11 months ago
48
david-cortes/contextualbandits #60

Possibly unexpected behaviour of decision function

Hello @david-cortes, thanks for this Contextual Bandits package. While using some of the online methods (BootstrappedTS, AdaptiveGreedy, maybe some others) from this package, I've faced some unexpe…

Yalikesifulei updated 1 year ago
1
VowpalWabbit/coba #22

Named feature support for VW buggy

Hi! When passing context features as a dictionary with the keys being their names they don't seem to be processed properly. While stepping through the code I believe the issue is in `_prep_namespac…

jonastim updated 1 year ago
10
ray-project/ray #24075

[RLlib] Training on custom contextual bandits with multi-dis…

### What happened + What you expected to happen I want to train on custom contextual bandits with multi discrete actions, but torch throws an error because of unexpected tensor-shapes: Failure…

philippGraf updated 2 years ago
2
openjournals/joss-reviews #5028

[REVIEW]: ALNS: a Python implementation of the adaptive larg…

**Submitting author:** @N-Wouda (Niels Wouda) **Repository:** https://github.com/N-Wouda/ALNS **Branch with paper.md** (empty if default branch): joss-paper **Version:** v5.0.4 **Editor:** @hugoledoux…

editorialbot updated 1 year ago
66
jungwoo-ha/WeeklyArxivTalk #46

[20220403] Weekly AI ArXiv 만담 - 46회차

- News - Deadline - Interspeech 2022, 수고 많으셨습니다! - ICML 22: Review out (4. 7, 저녁) - [인공지능과 지식재산백서](https://www.kipo.go.kr/ko/kpoBultnDetail.do?ntatcSeq=16558&aprchId=BUT0000048&searchC…

jungwoo-ha updated 2 years ago
6
david-cortes/contextualbandits #46

Question about using contextual bandits in specific case

Hi everyone, I am working on solving a peg-in hole problem. Initially I stared with RL approach but it seems like its not the right approach for my problem. **Task description** **Setup**: Rob…

danielstankw updated 2 years ago
1

上一页 1...3 4 5 6 7 8 9...13 下一页

128 results for contextual-bandits

128 results
for contextual-bandits