zt95 / infinite-horizon-off-policy-estimation

13 stars 7 forks source link

Question about discrete case #3

Closed clvoloshin closed 5 years ago

clvoloshin commented 5 years ago

Hi!

In the density_ratio_estimate function, you have "x = quadratic_solver(n, G/50., regularizer)". Where does the 50. come from?

zt95 commented 5 years ago

Oh that's just avoiding overflow of large integer when you collect too much data. Maybe I can do in a cleverer way.