awjuliani / Meta-RL

Implementation of Meta-RL A3C algorithm
MIT License
400 stars 109 forks source link

Why remove n-steps function? #4

Open RozenAstrayChen opened 5 years ago

RozenAstrayChen commented 5 years ago

Hello, I have question. If I use n-step to updated episode-buffer which will work better?

I thinks off-policy will work better than on-policy. sorry I haven't understand meta-learning, I just want ask your opinion