Non-episodic update of Multistep agent

kakaoenterprise / JORLDY

Repository for Open Source Reinforcement Learning Framework JORLDY

Apache License 2.0

362 stars 49 forks source link

Describe the bug A clear and concise description of what the bug is.

Samples of Multistep agent has trash value about post-terminal state.

To Reproduce Steps to reproduce the behavior:

Expected behavior A clear and concise description of what you expected to happen.

Screenshots If applicable, add screenshots to help explain your problem.

Development Env. (OS, version, libraries): Please describe current development environment

Additional context Add any other context about the problem here.

kakaoenterprise / JORLDY