Farama-Foundation / Minigrid

Simple and easily configurable grid world environments for reinforcement learning
https://minigrid.farama.org/
Other
2.13k stars 611 forks source link

Babyai text #454

Open BartekCupial opened 3 weeks ago

BartekCupial commented 3 weeks ago

Description

Before I will do the checklist I wanted to ask for feedback, do the community think that this change is useful and want me to continue?

Added BabyAI-Text environments to Minigrid, which provide textual descriptions of observations instead of symbolic representations. This extends the BabyAI platform to enable training language models as agent policies in interactive environments. The new environments maintain the same underlying mechanics as original BabyAI tasks but return text observations describing the agent's partial view of the environment.

Code is copied and adapted from https://github.com/flowersteam/Grounding_LLMs_with_online_RL

Key additions:

This addition enables research on grounding language models in interactive environments while maintaining the sample efficiency.

Type of change

Please delete options that are not relevant.

Screenshots

Example observation: Screenshot_20241031_190026 Corresponding description:

[   
    'You see a wall 2 steps right', 
    'You see a blue key 3 steps left and 1 step forward', 
    'You see a red box 3 steps left', 
    'You see a blue box 2 steps left and 1 step forward', 
    'You see a red ball 1 step left and 1 step forward', 
    'You see a grey ball 1 step forward', 
    'You see a blue box 1 step right and 1 step forward'
]

Checklist: