Open karalets opened 4 years ago
@karalets are you still working on this?
yeah I can revisit this pretty soon, I had a lot of this ready but got sidetracked. Thanks for the ping.
Hi @karalets, would be also very excited to see your work implemented in Pyro.
This issue proposes a pyro.contrib.agents module for building agent-based models and using control to guide their actions to maximize rewards.
This modeling task is commonly described as model-based reinforcement learning and will have the structure of a Markov Decision Process (MDP). We will use some previous publications (in particular NIPS2018WS and ICML2019WS) which incorporate latent structure into agent- and reward-models to guide this contrib and aim to iterate on versions of these models tutorial-style.
The underlying aim is to show how the model-control-exploration loop works in pyro and to use the latent variable models to demonstrate how transferrable structure can be learned and used for agent models.
Tasks