CUDA out of memory for large image inference

Summary of the bug

When running inference on images larger than ~1200x1200, CUDA often runs out of memory. This looks to be because the tiler puts all subwindows of a large image into a single batch (https://github.com/CosmiQ/solaris/blob/master/solaris/nets/infer.py#L75). This large batch can then be too large to fit into memory.

Steps to reproduce the bug

# In this case eg.yml points to images of size 2048x2048
import solaris as sol
config_path = 'eg.yml'
config = sol.utils.config.parse(config_path)
print('Config:')
print(config)
inferer = sol.nets.infer.Inferer(config)
inferer()

Buggy behavior and/or error message

RuntimeError: CUDA out of memory. Tried to allocate 2.00 GiB (GPU 0; 11.93 GiB total capacity; 8.75 GiB already allocated; 763.06 MiB free; 10.74 GiB reserved in total by PyTorch)

Expected behavior

Inference should run smoothly on large images

CosmiQ / solaris

CUDA out of memory for large image inference #361

Summary of the bug

Steps to reproduce the bug

Buggy behavior and/or error message

Expected behavior