kfoofw / bandit_simulations

Bandit algorithms simulations for online learning
79 stars 33 forks source link

Bandit_simulations

Bandit algorithms simulations and analysis for online learning

This repo is part of my interest to learn more about optimisation for online learning algorithms which are heavily centerd on bandit theory. Based on what I understand, there are different types of bandit problems:

This repo is segmented into both Python and R.

Analysis and Code Implementation

Phase 1 MAB analysis includes:

Phase 2 CB analysis (Currently ongoing):

Special Mention

A portion of the MAB code is based on the book "Bandit Algorithms for Website Optimization" by John Myles White.

Microsoft's vowpal wabbit package for Python can be found in this Github repo.

The R package for contextual can be found in this Github repo.