QiXuanWang / LearningFromTheBest

This project is to list the best books, courses, tutorial, methods on learning certain knowledge
8 stars 1 forks source link

Value Iteration Networks . by Aviv Tamar, Yi Wu, Garrett Thomas, Sergey Levine, Pieter Abbeel #3

Open QiXuanWang opened 4 years ago

QiXuanWang commented 4 years ago

Best Paper Award NIPS 2016.

Video: MSDN Channel 9

Main Invention: "Key to our approach is a novel differentiable approximation of the value-iteration algorithm, which can be represented as a convolutional neural network, and trained end-to-end using standard backpropagation."

Comment: Although the author used a very clever way to embed a RL problem in CNN like network, it's rarely used in public papers or projects somehow. And it can only be applied to low dimension domain due to it's nature. There are some papers extend this capability though. Not sure if it could be combined with HRL or maybe HRL itself will do the job without any VIN.

QiXuanWang commented 4 years ago

Github That implemented VIN: https://github.com/kentsommer/pytorch-value-iteration-networks#a-quick-thank-you https://github.com/DidiBear/vin-gym-pathfinding