sheldon123z / gitalk_comment

store comments of my website
0 stars 0 forks source link

22.2 主动增强学习 Active Reinforcement Learning | 奔三啦 #3

Open sheldon123z opened 2 years ago

sheldon123z commented 2 years ago

https://my-bucket-1302790708.cos-website.ap-guangzhou.myqcloud.com/2022/04/26/22-2/

Active learning 主动学习 Active ADP active ADP的更新公式 $$ U(s)=\max{a \in A(s)} \sum{s^{\prime}} P\left(s^{\prime} \mid s, a\right)\left[R\left(s, a, s^{\prime}\right)+\gamma U\left(s^{\prime}\right)\right

sheldon123z commented 2 years ago

测试一下

sheldon123z commented 2 years ago

效果拔群。。。

liu-jinyu commented 2 years ago

23333333