rickytan-AA / rickysrivalchessbot

MIT License
0 stars 0 forks source link

Environment Setup #1

Open rickytan-AA opened 3 years ago

rickytan-AA commented 3 years ago

We need to:

Helpful links:

rickytan-AA commented 3 years ago

Doesn't seem like there are many good RL-based environments for chess.

That's fine as it gives me a chance to experiment around with value functions and defining the reward states.

rickytan-AA commented 3 years ago

image

rickytan-AA commented 3 years ago

Should reward be a function from the environment or from the agent?

The environment should at least return game outcomes (+1 for a win, -1 for a loss, 0 for draws).

Probably need to read the AlphaZero paper to understand how they did their reward functions.