Eclectic-Sheep / sheeprl

Distributed Reinforcement Learning accelerated by Lightning Fabric
https://eclecticsheep.ai
Apache License 2.0
305 stars 31 forks source link

Fix bootstrap on termination #106

Closed belerico closed 1 year ago

belerico commented 1 year ago

Summary

This PR fixes the truncation bootstrapping, where we were wrongly saving the real next observations.

Type of Change

Please select the one relevant option below:

Please confirm that the following tasks have been completed:

Thank you for your contribution! Once you have filled out this template, please ensure that you have assigned the appropriate reviewers and that all tests have passed.