Open sheldon123z opened 2 years ago
https://my-bucket-1302790708.cos-website.ap-guangzhou.myqcloud.com/2022/04/26/22-2/
Active learning 主动学习 Active ADP active ADP的更新公式 $$ U(s)=\max{a \in A(s)} \sum{s^{\prime}} P\left(s^{\prime} \mid s, a\right)\left[R\left(s, a, s^{\prime}\right)+\gamma U\left(s^{\prime}\right)\right
测试一下
效果拔群。。。
23333333
https://my-bucket-1302790708.cos-website.ap-guangzhou.myqcloud.com/2022/04/26/22-2/
Active learning 主动学习 Active ADP active ADP的更新公式 $$ U(s)=\max{a \in A(s)} \sum{s^{\prime}} P\left(s^{\prime} \mid s, a\right)\left[R\left(s, a, s^{\prime}\right)+\gamma U\left(s^{\prime}\right)\right