PWhiddy / PokemonRedExperiments

Playing Pokemon Red with Reinforcement Learning
MIT License
7k stars 645 forks source link

Algorithms used? #110

Closed DiAvisoo closed 1 year ago

DiAvisoo commented 1 year ago

Love this project!

Mind adding a section in the README about what algorithms used to get a quick overview of how this works? I'm gonna look at the code but would be very convensient to get some info about this.

PWhiddy commented 1 year ago

Hi! Yes that would be great to have in the readme. Hoping to add this as well as some standard baseline results tables in the not so far future. For now, short answer is that it uses PPO with fairly normal hyperparameters. You can see these here: https://github.com/PWhiddy/PokemonRedExperiments/blob/624e6f08d284a498d56e073e4434cd8053fab9a3/baselines/run_baseline_parallel_fast.py#L62

Rewards are a bit more complicated, video describes some of this but you'll have to look at the code for more details for now.