Open jbloomAus opened 1 year ago
This turns out to be hard because we get underflow with the prob environments on the RNN and need to edit the trajectory LSTM model which expects to get mini-grid environment frames. I will need to think more about how to test it with probe environments.
We currently have 5 probe environments for single timestep models and I'd like a prob environment to test if a model can learn: