pyro-ppl / pyro

Deep universal probabilistic programming with Python and PyTorch
http://pyro.ai
Apache License 2.0
8.44k stars 985 forks source link

[FR] Contrib module for agent-based models and model-based RL #1964

Open karalets opened 4 years ago

karalets commented 4 years ago

This issue proposes a pyro.contrib.agents module for building agent-based models and using control to guide their actions to maximize rewards.

This modeling task is commonly described as model-based reinforcement learning and will have the structure of a Markov Decision Process (MDP). We will use some previous publications (in particular NIPS2018WS and ICML2019WS) which incorporate latent structure into agent- and reward-models to guide this contrib and aim to iterate on versions of these models tutorial-style.

The underlying aim is to show how the model-control-exploration loop works in pyro and to use the latent variable models to demonstrate how transferrable structure can be learned and used for agent models.

Tasks

Rish001 commented 4 years ago

@karalets are you still working on this?

karalets commented 4 years ago

yeah I can revisit this pretty soon, I had a lot of this ready but got sidetracked. Thanks for the ping.

BartekSzpak commented 3 years ago

Hi @karalets, would be also very excited to see your work implemented in Pyro.