Closed helpingstar closed 10 months ago
This is a parameter introduced in TD3, an extension of DDPG (see http://proceedings.mlr.press/v80/fujimoto18a/fujimoto18a.pdf). Reading the CleanRL docs, its clear the DDPG implementation was heavily inspired by the TD3 authors implementation of TD3 and DDPG. My guess is this parameter got accidentally added to the parser because its required for TD3 and can safely be removed from this DDPG implementation.
@glass1720 Thank you
https://github.com/vwxyzjn/cleanrl/blob/7e24ae238eab6a8e7efbbf452cb4a8922bcda73f/cleanrl/ddpg_continuous_action.py#L65-L66
It doesn't appear to be used in the code and there doesn't appear to be any mention of it in the documentation.
How should I use this parameter?
https://github.com/vwxyzjn/cleanrl/blob/7e24ae238eab6a8e7efbbf452cb4a8922bcda73f/cleanrl/ddpg_continuous_action.py#L196
Can I just add
torch.clip
(ortorch.clamp
) to the above code as shown below?