Open andreaskoepf opened 2 years ago
Baseline run of old version: Tensorboard events file, charts (screenshot).
I reverted the main branch (--hard & force push) back to the Apr 7th working state. The refactoring commits have been moved to xmaster_refactor branch. I would suggest to treat xmaster_refactor
as a temporary branch that is sealed and to add changes in a clean way (in multiple steps) back to the main branch.
General notes:
Working commit: https://github.com/world-modelz/dreamax/commit/3e0ac35f7e44946a26430ca489e53ed415c84aa3
Pendulum after ~30min training at 100k env steps and >>500 average return.