bandit: Add epsilon-decreasing option

jubatus / jubatus_core

Jubatus algorithm component

GNU Lesser General Public License v2.1

20 stars 29 forks source link

bandit: Add epsilon-decreasing option #354

Closed imaimai1125 closed 7 years ago

imaimai1125 commented 7 years ago

A fixed epsilon, the constant factor in a simple epsilon-greedy strategy, prevents the strategy from getting arbitrarily close to the optimal lever.

To solve this problem, some paper suggest natural variant of epsilon-greedy strategy, which is called "epsilon-decreasing strategy". (Ref1(Auer et al.), Ref2(Vermorel et al.))

It may be good to add an option of decreasing epsilon to the epsilon-greedy method.

kmaehashi commented 7 years ago

Implemented via PRs above.