daisatojp / mpo

PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
GNU General Public License v3.0
70 stars 19 forks source link

Is there a single book or online course you would recommend to begin to comprehend this repo? #14

Open MotorCityCobra opened 2 years ago

MotorCityCobra commented 2 years ago

I know basic stats and calculus but I can't match anything from the paper to anything in this repo. I can't even pin point which part is the a posteriori in the paper.
It is above my head. The code looks great and I like the use of the symbols.
I would like to get to know it up close to use it as a multi objective MPO, (MOMPO) and RL algorithms beyond.
Thank you.