[[Project Page]]()
This repo aims to reproduce and open the results obtained from "Generative Interactive Environments" of Google DeepMind.
I'm currently trying to find people interested in reproducing the papers results at a smaller scale. Don't hesitate to contact me to work together on it : alexandre.chapin@ec-lyon.fr
[2024.03.09] Beginning of the project
Create a Python venv or a Conda environment using Python 3.10. Inside of the environment install torch
and torchivision
using the instructions matching your system as listed on the Pytorch website.
Then install the remaining dependencies:
python -m pip install -r requirements.txt
Run the tests using :
pytest tests/
While the project will grow, more test will be added and you'll maybe need to just select a subset of tests related to the changes you made by using the -k
option of pytest
. Running tests in parallel (in the example 4 processes) with the -n
option may help :
pytest -k "substring-to-match" -n 4 tests/
βββ README.md
βββ CONTRIBUTION.md
βββ scripts -> All scripts.
βββ genie
β βββ dataset
β βββ models
β β βββ ae
β β β βββ vae
β β β βββ vqvae
β β βββ stt
β β β βββ mask_git
β β β βββ st_transformer
β β βββ video_tokenizer
β β βββ dynamic_model
β β βββ latent_action_model
β β βββ genie_model
β βββ train -> Training code
β βββ utils
Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.
This project builds upon the work from Google DeepMind's research team :
@misc{bruce2024genie,
title={Genie: Generative Interactive Environments},
author={Jake Bruce and Michael Dennis and Ashley Edwards and Jack Parker-Holder and Yuge Shi and Edward Hughes and Matthew Lai and Aditi Mavalankar and Richie Steigerwald and Chris Apps and Yusuf Aytar and Sarah Bechtle and Feryal Behbahani and Stephanie Chan and Nicolas Heess and Lucy Gonzalez and Simon Osindero and Sherjil Ozair and Scott Reed and Jingwei Zhang and Konrad Zolna and Jeff Clune and Nando de Freitas and Satinder Singh and Tim RocktΓ€schel},
year={2024},
eprint={2402.15391},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
We also built upon the following repositories for the different parts of our models :
This project is licensed under the MIT license - see the LICENSE file for details.