Open half-potato opened 1 year ago
I would suspect a memory issue. In the second pass, we switch to learning 2D textures so the memory requirement goes up a bit. If you are running near the memory limit, perhaps try decreasing the texture resolution a bit, e.g., using the config flag "texture_res": [ 512, 512 ],
, or, if you are running on a GPU with <32GB of memory, perhaps also reduce the batch size.
You can track memory usage by nvidia-smi or nvitop https://github.com/XuehaiPan/nvitop
I agree. I just realized nvdiffrecmc works with batch size = 6 without crashing. This is for the purpose of benchmarking, so I hope this doesn't decrease accuracy too much.
Not sure what kind of information you need to debug this.