haje01 / gym-tictactoe

OpenAI Gym Style Tic-Tac-Toe Environment
MIT License
68 stars 36 forks source link

Values in best_td_agent #8

Open premsai1503 opened 1 year ago

premsai1503 commented 1 year ago

Hi, i was trying to develop a RL agent who can play the Tic Tac Toe game and was wondering how did you get the data in the best_td_agent.dat file. Because, it is used as the optimal policy for the your agent to follow. Did you save them after playing many episodes or is there any other methods. Your answer would be really helpful

haje01 commented 1 year ago

That file is nothing special. You can create your own file by training, evaluating and saving iteratively many times.