-
Thanks for sharing the demo. I had some questions regarding the RNN-MDN module used to sample zt+1.
Since the latent sample is 32-dimensional for the car racing experiment, does the MDN model each…
-
Thinks your answer. When I try to train the MDN-RNN, some problem that I have never seen occur.I think the chainer and cupy version may be the vital reseason.Is it?
-
## 一言でいうと
強化学習において環境のシミュレーター(World Model)とエージェントの操作(Controller)を分けて考えたモデル。環境は画面の表現をVAEで、時系列の遷移をRNNで学習(次時刻におけるVAEの潜在表現zの分布を予測する)、操作側はVAEの潜在表現とRNNの隠れ層を結合して重みをかけるだけというシンプルさ。
シミュレーターとコントローラーは別個に学習する…
-
Is an update for TensorFlow 2 being considered?
-
Four keypoints:
1. Learning Dynamics.
- Firstly try: Ensemble Dynamics using MSE. (MB-MPO)
- Then, try RNN.
- Model uncertainty: MDN or GANonZ , on 1234 dataset.
- Afterwards: Use …
-
Hi @hardmaru , thanks again for the great work in making this paper accessible and reproducible!
I have a question about the predictions made by M:
In the dreams, it seems M is able to produce qui…
-
hi ctallec,
In file mdrnn.py:
I observed the neural number in gmm_linear layer is too few, why is the output size defined as (2 * latents + 1) * gaussians + 2? Shouldn't it be 3 * latents * gaussian…
-
According to my understanding, the controller C at time t has two inputs. zt and ht, now ht is the prediction produced by M from zt-1, ht-1 and at-1.
So basically, the controller makes its predict…
-
Hi,
I am using OpenPilot to conduct some experiments related to vehicle security. This article helps a lot! I hope to know whether you have any idea about how the NN get the prob for the left and ri…
-
Amazing work! I had a practical question about the time it took to train these models on the setup you described in the article. Would you be able to share more? In addition, would this repository be …