Open edmund735 opened 7 months ago
Any updates on this?
@edmund735 thanks for filing this issue. We are moving to a new API stack these times and the _enable_new_api_stack
is a key to signal, if the new or old stack should be used. This has to be part of the configuration - which is usually of type AlgorithmConfig
and not dict
as in your case of the example. You can take a look at our examples (e.g. https://github.com/ray-project/ray/blob/master/rllib/examples/_old_api_stack/complex_struct_space.py). The AlgorithmConfig
takes care of all needed keys.
In addition, when using the new stack (at best in the actual ray release) the api_stack
method should be used to enable the new api stack (see here for an example: https://github.com/ray-project/ray/blob/cf7a09daa201c92d71cffa9f4a8e0afe45ab0475/rllib/utils/test_utils.py#L1370).
What happened + What you expected to happen
After implementing the
reset_config()
method for PPO and running PB2 withreuse_actors=True
with Pendulum-v1, it gives this error:If I set
reuse_actors = False
under the PB2 configs, this error doesn't show up, so I assume that maybe I don't have a correct implementation ofreset_config()
? I added it under theclass PPO(Algorithm):
under the ray library in my conda env:Versions / Dependencies
python=Python 3.10.14 ray-rllib=2.9.3
Reproduction script
The PPO class:
script:
Issue Severity
High: It blocks me from completing my task.