Closed Nick-Kou closed 3 years ago
Hi,
"last_action is not defined"
Thanks for pointing out. In test, adding last_action = env.action_space.sample()
should work, just as in training.
NormalizedActions
I think the normalisation is to make the standard policy output from [-1, 1] to [action_space.low, action_space.high].
Hi,
After training sac_lstm by running
python3 sac_v2_lstm.py --train
, I then attempted to test. Upon testing, I received an error stating name "last_action is not defined". More specifically:For now I added
last_action = env.action_space.sample()
in line 302 after the else statement, however, I am unsure if this is correct.Lastly, I just had a question with regards to the "NormalizedActions" class in line 50 sac_v2_lstm.py. How will this be useful if the action space is for example [-1,+1] but normalisation scales to [0,1], is there any reference text for this motivation?
Thanks