Closed gkirgizov closed 3 weeks ago
Paper Focus:
Preliminary results on Simple bandits VS. Baseline GOLEM:
Need trying contextual bandits and offline pretraining.
Experiment setup: 10-15 minutes and 5-10 trials per case. Agent with UCB algorithm (alpha=1.25)
This is the meta-issue tracking progress with GOLEM paper that introduces GOLEM framework with use-cases and adaptive features.
What's required from collaborators who add their use-cases
Pre-requisite PRs & Issues:
Experiments: