Proposal: do something about redundantly large observations

The observations are large images due to spriteSize=8. However, most of this information is redundant. In the rllib example, a convolutional layer is applied with stride 8 to leave only the first pixel out of 8x8=64 pixels, meaning that observations are x64 bigger than needed. This might be fine for on-policy methods, but for methods that use replay buffers, this transforms several gigabytes of memory into several hundred gigabytes. I had to change spriteSize=8 to spriteSize=1 in the config files of specific environments which is fine but not the most elegant (or intended) solution.

As far as I understand, spriteSize=8 is only useful for rendering. So a good solution to this issue could be to use spriteSize=1 everywhere but transform small images into bigger images as a subroutine during rendering. A worse but maybe easier solution would be to allow users to change spriteSize in the config (and also explain this parameter in the examples).

(this is based on the state of the repo a couple of weeks ago, I don't know if anything changed)

google-deepmind / meltingpot

Proposal: do something about redundantly large observations #152