tesslerc / malmo_rl

MIT License
2 stars 2 forks source link

A Malmo reinforcement learning environment

Visdom graphs

Project structure

Notes

Policies out of the box

This package comes with several prebuilt policies:

C51-DQN

QR-DQN

Requirements

Running the agent

python3.6 main.py <policy> <agent> [parameters]

For instance, to automatically load Malmo:

python3.6 main.py qr_dqn single_room --number_of_atoms 200 --number_of_agents 1 --retain_rgb --save_name qr-dqn-test

A more robust solution is to open Malmo externally and provide the ports (default port for Malmo is 10000):

python3.6 main.py qr_dqn single_room --number_of_atoms 200 --number_of_agents 1 --malmo_ports 10000 --retain_rgb --save_name qr-dqn-test

Disclaimer

This codebase was built for my research, this is not an official product of any sort. I urge users to submit issues, bug reports and fixes to help make this better for everyone.

To cite this repository in publications:

@misc{malmo_rl,
  author = {Chen Tessler},
  title = {Malmo Reinforcement Learning Environment},
  year = {2018},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/tesslerc/malmo_rl/}},
}