RatInABox-Lab / RatInABox

A python package for modelling locomotion in complex environments and spatially/velocity selective cell activity.
MIT License
172 stars 31 forks source link

Hide and Seek In A Box? #66

Closed aadharna closed 1 year ago

aadharna commented 1 year ago

I just stumbled upon this from your gymnasium compliance tweet.

I was wondering how feasible would it be to combine the field of view + multi-agent + new gym wrapper classes to create a hide-and-seek environment similar to mujoco-based works (with bonus points if you could also move walls around) using the tools here?

If not, and if this isn't something you'd have interest in having in here, no worries.

TomGeorge1234 commented 1 year ago

This would be awesome. I'd definitely put it on the "harder" side of the spectrum but totally within scope, particularly if you know what you're doing.

Some of this isn't natively supported but wouldn't be hard to do, for example Agents moving walls around isn't supported but you could totally reach in, access the walls parameter, then manually adjust one by a small amount depending, say, on the Agents position/velocity.

If you got it working I'd be thrilled to have it here as a contrib/demo. I'm sure people would use it. Happy to chat further if you're seriously considering this.

aadharna commented 1 year ago

Definitely interested and serious about it!

With how my schedule looks at the moment I probably won't get to this for a couple of weeks, but once my agents can crush my current environment, H&S is the logical next domain.

TomGeorge1234 commented 1 year ago

Are you current Agents in RiaB or something else?

Yeah, take your time, but I'd be excited to support this. Get back in touch when you're ready/if you need support.

aadharna commented 1 year ago

My agents are just pytorch NNs -- no notion of firing rates or biologically inspired agents, as your project here focuses on-- I do pure deep rl (and for this project am training agents via historical self-play).

My environments are a mix of homemade games (a weird one-step cooperative multi-agent game, and an iterated competitive matrix game) and board games (Connect 4, Go, etc) right now.

A stretch goal for the research is to get something similar to the Open AI H&S domain I linked above, but that's not the focus at the moment while I iterate on the main scientific contributions in simple environments.

TomGeorge1234 commented 9 months ago

Hi Aaron, I just remembered this discussion - did you ever move forwards with it, was wondering how you got on?

aadharna commented 9 months ago

Hey Tom!

I am still interested; however, my advisor told me to put that on hold until after I finish up with quals. So, no movement yet.

TomGeorge1234 commented 9 months ago

Awesome - well good luck with those quals and keep an eye on things over here. We're beginning work on RiaB2.0 (#60) and dynamic environments (where it would be possible to move walls around and play hide and seek) is on the list of to dos.