Generic POMDP Support - Githubissues

jonathan-laurent / AlphaZero.jl

A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.

MIT License

1.24k stars 140 forks source link

Hello! Apologies if this has been asked elsewhere.

I remember seeing that a goal for AlphaZero.jl is to support any class of POMDP. Is there any update as to the status of that milestone? As an end user this would be a huge deal for me, though I’m not familiar enough with the ecosystem and research to implement it myself at this point.

Some useful features: multi-agent games, incomplete information, and continuous (or mixed discrete-continuous) action spaces.

Any insight on plans for these things or implementations in other package ecosystems would be much appreciated!

jonathan-laurent / AlphaZero.jl

Generic POMDP Support #212