GPU memory usage optimisation

Hi Jinghao,

I did some GPU memory profiling on https://github.com/JinghaoLu/MIN1PIPE/blob/32084136b3d67cb77f94c1cd3e26d85dc9712f0a/utilities/movement_correction/inter_section.m#L89 If tmp is a 480-by-752-by-5 double (around 14 MB) I get a peak GPU memory usage of 623 MB, i.e. about 45 times that of a single matrix. I tested it with smaller matrices as well and it seems that MATLAB has some GPU overhead of 200 MB, meaning that every 1 MB in tmp will increase maximum GPU memory usage by about 25 MB. This was of course all tested without a parallel pool.

Is there some way to improve GPU memory usage? The increased GPU memory load really limits the number of parallel workers.

For example, how often is the image padding used? I didn't go through all the functions called by lk_logdemons_unit() but gradient_fast() for example only seems to need a 0-padding margin of one element on every side of the image.

Best wishes, Daniel

JinghaoLu / MIN1PIPE

GPU memory usage optimisation #31