StevenMaio / rl-repair

Repository for python implementation of thesis work
1 stars 0 forks source link

[feat] implemented serial policy gradient #19

Closed StevenMaio closed 1 year ago

StevenMaio commented 1 year ago
  1. implemented serial version of policy gradient estimator