Open AvisP opened 1 year ago
I realized this issue was happening because I was providing discrete actions ( change_line_status
, change_bus
) to the train
function of PPO_RLLIB
. Inside this function the conversion of action space of grid2ops to gym env is happening using BoxGymActSpace
which handles continuous actions. But for discrete actions it needs to have MultiDiscreteActSpace
or DiscreteActSpace
. Although I am not sure what should be the solution when there is a mixture of discrete and continuous actions.
System information
1.8.1
0.6.0.post1
mac osx, ubuntu16.04, ...
PPO_RLLIB
Bug description
When I am evaluating trained PPO_RLLIB agent the total score for chronics is getting printed out as 0. Even if the PPO_RLLIB agent didn't get trained properly, but total_score should still be non-zero.
Output I am getting is
How to reproduce
The training script I used
The evaluation script used