alexcbb / Genie-Generative-Interactive-Environments

This repo aims to reproduce and open the results obtained from "Generative Interactive Environments" of Google DeepMind.
MIT License
5 stars 2 forks source link

Unofficial Genie : Generative Interactive Environments

[[Project Page]]()

License

πŸ’ͺ Goal

This repo aims to reproduce and open the results obtained from "Generative Interactive Environments" of Google DeepMind.

I'm currently trying to find people interested in reproducing the papers results at a smaller scale. Don't hesitate to contact me to work together on it : alexandre.chapin@ec-lyon.fr

πŸ“° News

[2024.03.09] Beginning of the project

πŸ› οΈ Installation

Create a Python venv or a Conda environment using Python 3.10. Inside of the environment install torch and torchivision using the instructions matching your system as listed on the Pytorch website.

Then install the remaining dependencies:

python -m pip install -r requirements.txt

πŸ”Ž Unit tests

Run the tests using :

pytest tests/

While the project will grow, more test will be added and you'll maybe need to just select a subset of tests related to the changes you made by using the -k option of pytest. Running tests in parallel (in the example 4 processes) with the -n option may help :

pytest -k "substring-to-match" -n 4 tests/

βœ… TODO list :

Setup the codebase

Conduct experiments

πŸ“‚ Repo structure (WIP)

β”œβ”€β”€ README.md
β”œβ”€β”€ CONTRIBUTION.md
β”œβ”€β”€ scripts                        -> All scripts.
β”œβ”€β”€ genie
β”‚   β”œβ”€β”€ dataset
β”‚   β”œβ”€β”€ models
β”‚   β”‚   β”œβ”€β”€ ae      
β”‚   β”‚   β”‚   β”œβ”€β”€ vae
β”‚   β”‚   β”‚   └── vqvae
β”‚   β”‚   β”œβ”€β”€ stt              
β”‚   β”‚   β”‚   β”œβ”€β”€ mask_git          
β”‚   β”‚   β”‚   └── st_transformer
β”‚   β”‚   β”œβ”€β”€ video_tokenizer
β”‚   β”‚   β”œβ”€β”€ dynamic_model
β”‚   β”‚   β”œβ”€β”€ latent_action_model
β”‚   β”‚   └── genie_model
β”‚   β”œβ”€β”€ train                      -> Training code
β”‚   └── utils

🀝Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.

πŸ‘ Acknowledgment

This project builds upon the work from Google DeepMind's research team :

@misc{bruce2024genie,
      title={Genie: Generative Interactive Environments}, 
      author={Jake Bruce and Michael Dennis and Ashley Edwards and Jack Parker-Holder and Yuge Shi and Edward Hughes and Matthew Lai and Aditi Mavalankar and Richie Steigerwald and Chris Apps and Yusuf Aytar and Sarah Bechtle and Feryal Behbahani and Stephanie Chan and Nicolas Heess and Lucy Gonzalez and Simon Osindero and Sherjil Ozair and Scott Reed and Jingwei Zhang and Konrad Zolna and Jeff Clune and Nando de Freitas and Satinder Singh and Tim RocktΓ€schel},
      year={2024},
      eprint={2402.15391},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

We also built upon the following repositories for the different parts of our models :

πŸ”’License

This project is licensed under the MIT license - see the LICENSE file for details.