harukaki / brl

reinforcement learning for bridge
Apache License 2.0
8 stars 1 forks source link

Add reward_scaling #7

Closed harukaki closed 1 year ago

harukaki commented 1 year ago

ppo code optimaizationの一つであるreward_scalingを導入