I am trying the recurrent PPO examples with my own single environment. In any case I will get this size error.
File "ppo_lstm", line 66, in compute
rnn_input = states.view(-1, self.sequence_length, states.shape[-1]) # (N, L, Hin): N=batch_size, L=sequence_length
RuntimeError: shape '[-1, 128, 23]' is invalid for input of size 736
23 is the observation size * 32 mini-batch size = 736
I registered the environment to vectorize it, setting the num_workers to 1, but I am not 100% sure if it worked. Making gym.vector.make() at least didn't give an error:
Sorry, this might be a trivial question
I am trying the recurrent PPO examples with my own single environment. In any case I will get this size error.
23 is the observation size * 32 mini-batch size = 736
I registered the environment to vectorize it, setting the num_workers to 1, but I am not 100% sure if it worked. Making
gym.vector.make()
at least didn't give an error:I also tried without vectorizing, but it also didn`t work. SAC is working fine of course without vectorizing.
Basic information