Closed AdriaPerezCulubret closed 9 months ago
These are some old changes I had on a local repository from some time ago. They are old, but some are a bit substantial to AdaptiveBandit's algorithm. Also includes an option to use macrostates to compute rewards.
These are some old changes I had on a local repository from some time ago. They are old, but some are a bit substantial to AdaptiveBandit's algorithm. Also includes an option to use macrostates to compute rewards.