Probabilistic ensembles with trajectory sampling (PETS) is a leading type of MBRL, which employs Bayesian inference to dynamics modeling and model predictive control (MPC)with stochastic optimization via the cross entropy method (CEM). we propose a novel extension to the uncertainty-aware MBRL
Innovation/Contribution:
Firstly, we introduce a variational inference MPC (VI-MPC),which reformulates various stochastic methods, including CEM, in a Bayesianfashion. Secondly, we propose a novel instance of the framework, called prob-abilistic action ensembles with trajectory sampling (PaETS).
Conclusion:
In comparison to PETS, our method consistently improvesasymptotic performance on several challenging locomotion tasks
Comments:
This paper is published after "Benchmarking Model-Based Reinforcement Learning" link where author got the conclusion that MBRL is sub-optimal compared with MFRL. But will this paper proves that MBRL indeed is good enough?
Link: Arxiv
Problem:
Innovation/Contribution:
Conclusion:
Comments: This paper is published after "Benchmarking Model-Based Reinforcement Learning" link where author got the conclusion that MBRL is sub-optimal compared with MFRL. But will this paper proves that MBRL indeed is good enough?