Dilated convolution - Githubissues

facebookresearch / SparseConvNet

Submanifold sparse convolutional networks

Other

2.04k stars 332 forks source link

Hello,

thank you for open-sourcing and maintaining this repository.

I was wondering if there was a way (or a trick) enabling one to run dilated sparse convolutions.
I would like to train a sparse convolution layer with a huge receptive field, but I can not afford to have too many parameters, hence dilated convolution sounded adapted.

What I have tried so far is feeding a sparse tensor as SubManifoldConvolution's weight:

conv_layer = SubmanifoldConvolution()
conv_layer.weight = Parameter(conv_layer.weight.data.to_sparse())

-> Which fails at forward time: RuntimeError: sparse tensors do not have strides

I see a dirty way to proceed, which would be declaring a SubmanifoldConvolution with a big kernel, and zero-ing parts of the gradients at backward time through a hook, but it is computationally inefficient, since many gradients will be computed for nothing.

Thank you for your help,

facebookresearch / SparseConvNet

Dilated convolution #239