Closed qas32 closed 8 years ago
When the input tensor is too big, number of blocks exceeds 65535, which causes crashes when compiling with arch=sm_20. That causes issues when trying to loadcaffe VGG_M network with 'cudnn', which uses inn.SCRN .
Any reason for not using sm_35 ?
you can just use cudnn. I will update loadcaffe in a few days to support R3.
fixed by https://github.com/szagoruyko/imagine-nn/pull/16
When the input tensor is too big, number of blocks exceeds 65535, which causes crashes when compiling with arch=sm_20. That causes issues when trying to loadcaffe VGG_M network with 'cudnn', which uses inn.SCRN .
Any reason for not using sm_35 ?