anubisthejackle / 2048-Deep-Learning

MIT License
77 stars 33 forks source link

Weights and Rewards. #16

Open anubisthejackle opened 8 years ago

anubisthejackle commented 8 years ago

I've been pondering weighting and rewards.

I'll update this issue with more research, as I come across it.

Score-based Rewards

Currently the way we offer rewards on score actually stops providing a benefit as time progresses. This is not optimal, but it was the method I chose so I could normalize a variable weight between 0 and 1.

After looking at the problem further, I realized that there actually is a finite range, a mathematical maximum, for the score, and if I were to use that as the maximum, I could then normalize the weight with an ever increasing benefit, instead of how it is now.