Open Summer142857 opened 3 years ago
Hi! No, the implementation doesn't explicitly cover continuous actions spaces. You may apply RUDDER to continuous action spaces by modifying the input to the reward redistribution LSTM: instead of discrete actions as input features, use continuous actions as input features for the LSTM.
It seems that the situations with continuous action space are not illuminated in the code.