Long term - Githubissues

Ladder nets instead of AE; consider attention mechanisms here. Ladder impls (paper):

Use the model f that we learned (model based-rl algs, can we apply to MPC? -> need to find industrial-strength MPC algo).

mwhittaker / deeprl_project