instadeepai / Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
Apache License 2.0
709 stars 83 forks source link

feat: rec iql (OUTDATED) #1048

Closed lbeyers closed 6 months ago

lbeyers commented 7 months ago

What?

(Draft) Adding a first version of recurrent IQL that performs very well on smax.

Why?

The first version of Q-learning must be added to MAVA so that the rest of the versions can be built off it.

How?

The recurrent IQL file as well as all of its config structure is now running in a periodically-kept-up-to-date "develop" base.

Extra: I am still working on...

The IQL implementation file still needs:

lbeyers commented 6 months ago

Thanks @WiemKhlifi ! Sorry to say this now only, but this branch is out of date - a lot of the strictly q-learning things are the same but I will need to put out a PR for a different branch when it comes to it!!

lbeyers commented 6 months ago

Updated rec IQL will be going in a different PR - tomorrow!