subaochen / subaochen.github.io

MIT License
1 stars 3 forks source link

policy improvement的数学证明 #76

Open subaochen opened 5 years ago

subaochen commented 5 years ago

https://subaochen.github.io/reinforcement%20learning/2019/08/21/policy-improvement-math-prove/