MiniMax Algorithm? - Githubissues

coax-dev / coax

Modular framework for Reinforcement Learning in python

MIT License

168 stars 17 forks source link

Hi @flaport

First of all thanks for your interest in coax!

It would be great to see multi-agent style setups in coax. I haven't thought much about it, to be honest.

The simplest setup would be to use separate policies and either update the policies individually or write your own policy objective that updates multiple policies at the same time.

Having said that, I'm not an expert in multi-agent RL myself, so I'm not aware of all the subtleties associated with such a setup.

But of course, I welcome contributions and I'm curious to see what you come up with!

coax-dev / coax

MiniMax Algorithm? #30