aribis369 / ML-Starter-Pack

A collection of Machine Learning algorithms written from sctrach.
75 stars 52 forks source link

UCB(Reinforcement Learning) #51

Closed devanshb26 closed 6 years ago

devanshb26 commented 6 years ago

Upper Confidence Bound Learning.The Given Code is used to find which out of the 10 ads to be displayed on website for maximum Click Through Response by the user. The dataset is virtual showing what the ith user would have done if one of the 10 ads was shown to him i.e. 1 specifying he would have clicked it and 0 means he would have ignored the add. The 3 steps used for the mathematics given behind the algorithm is given for a better understanding.

aribis369 commented 6 years ago

@devanshb26 Nice job.

aribis369 commented 6 years ago

Merged.