kakaoenterprise / JORLDY

Repository for Open Source Reinforcement Learning Framework JORLDY
Apache License 2.0
359 stars 50 forks source link

[-] Fix mpo type 1step_TD #166

Closed kan-s0 closed 2 years ago

kan-s0 commented 2 years ago

:star2: Hello! Thanks for contributing JORLDY!

Checklist

Please check if you consider the following items.

Types of changes

Please describe the types of changes! (ex. Bugfix, New feature, Documentation, ...) bugfix

Test Configuration

Description

  1. if mpo step type is 1step_TD, tmp_buffer size set self.n_step(1step_TD).
  2. if mpo config step type is 1step_TD, n_step is also unified as 1.