Eclectic-Sheep / sheeprl

Distributed Reinforcement Learning accelerated by Lightning Fabric
https://eclecticsheep.ai
Apache License 2.0
302 stars 30 forks source link

Dreamer-V2 #43

Closed belerico closed 1 year ago

belerico commented 1 year ago

Summary

This PR introduce the Dreamer-V2 algorithm from https://arxiv.org/abs/2010.02193

Type of Change

Please select the one relevant option below:

Checklist

Please confirm that the following tasks have been completed:

Screenshots or Visuals (Optional)

If applicable, please provide screenshots, diagrams, graphs, or videos of the changes, features or the error.

Additional Information (Optional)

Those are the training loss and rewards con DMC Walker-Walk with the standard hyperparameters found in dreamer_v2/args.py

image image

Thank you for your contribution! Once you have filled out this template, please ensure that you have assigned the appropriate reviewers and that all tests have passed.

belerico commented 1 year ago

Maybe it's better to account for the action repeat as we have done in Dreamer-V1 @michele-milesi