ResourceExhaustedError: OOM when allocating tensor

a2018c commented 6 years ago

Hi,

When trying "--growth_rate=12 --depth=100 --dataset=C100", it returned "ResourceExhaustedError (see above for traceback): OOM when allocating tensor with shape[64,372,32,32] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc"

From the GPU usage, I found that it only used GPU[0] and hit OOM:

How to resolve it? Regards

a2018c commented 6 years ago

(For other people if get same issue: my workaround is to set batch size to 8)

ikhlestov commented 6 years ago

Hi! You are definitely right. You've got OOM error, because network cannot fit provided memory. To reduce network size you may decrease some parts of it - batch size, growth rate, number of layers, etc.

ikhlestov / vision_networks

ResourceExhaustedError: OOM when allocating tensor #25