Can I reproduce the results of NeuIPS 2019 using this code?

HumanCompatibleAI / overcooked_ai

A benchmark environment for fully cooperative human-AI performance.

MIT License

714 stars 152 forks source link

Hi there. I believe the differences you see are due to the fact that the reward in the figure above may be the dense shaped reward, rather than the sparse reward, which is what is reported in the original paper and in the plots here, which were generated using the src/human_aware_rl/ppo/run_experiments.sh script.

There have been various changes since the NeurIPS 19 version, but the plots at the link above were our attempt to show that the changes do not significantly affect the final results.

As long as you are able to reproduce the results from our new figure in the README (which you should be able to by using the src/human_aware_rl/ppo/run_experiments.sh script), I encourage you to use the newer version of the code, as it will be nicer to work with than the neurips2019 branch.

Let me know if you have any additional questions, or run into any issues.

HumanCompatibleAI / overcooked_ai

Can I reproduce the results of NeuIPS 2019 using this code? #143