Eclectic-Sheep / sheeprl

Distributed Reinforcement Learning accelerated by Lightning Fabric
https://eclecticsheep.ai
Apache License 2.0
305 stars 31 forks source link

Feature/buffer np #169

Closed belerico closed 9 months ago

belerico commented 9 months ago

Summary

In this PR we have introduced our new buffers, that handle data with standard dictionaries of numpy arrays instead of TensorDicts. This change has been introduced to keep algorithms implementation and interaction as simple as possible, while also gaining in terms of speed performance. In particular:

Type of Change

Please select the one relevant option below:

Checklist

Please confirm that the following tasks have been completed:

Thank you for your contribution! Once you have filled out this template, please ensure that you have assigned the appropriate reviewers and that all tests have passed.