Eclectic-Sheep / sheeprl

Distributed Reinforcement Learning accelerated by Lightning Fabric
https://eclecticsheep.ai
Apache License 2.0
282 stars 29 forks source link

Feature/ppo recurrent refactoring #105

Closed belerico closed 9 months ago

belerico commented 9 months ago

Summary

In this PR we have uniformed the recurrent PPO implementation. In particular:

Type of Change

Please select the one relevant option below:

Checklist

Please confirm that the following tasks have been completed:

Thank you for your contribution! Once you have filled out this template, please ensure that you have assigned the appropriate reviewers and that all tests have passed.