SulRash / envenc

Repository for environment encoder, an attempt at improving reinforcement learning agents' generalisability through learning how to act on universal multimodal embeddings generated by a vision-language model.
MIT License
2 stars 0 forks source link

Migrate to CleanRL #1

Closed SulRash closed 5 months ago

SulRash commented 5 months ago

Migrating to cleanrl would make my life a lot easier.

There is currently a branch for this and it's on-going

SulRash commented 5 months ago

Since cleanrl uses envpool I found out that the following preprocessing happens to the image:

Need to take this into account during dev

SulRash commented 5 months ago

Slight issue... I can't figure out how to modify envpool such that each environment gets its frames processed by the VLM. To avoid this and accelerate development, I've limited the number of environments to 1 but I can see this causing a lot of issues down the line. Maybe I should switch to the non-envpool version for now?

SulRash commented 5 months ago

Reading the docs of cleanrl they have these 2 hacks they did for Atari games:

A CNN won't make sense for the embeddings and scaling the image won't make sense either so I need to disable both options.