NVlabs / pacnet

Pixel-Adaptive Convolutional Neural Networks (CVPR '19)
https://suhangpro.github.io/pac/
Other
514 stars 79 forks source link

some basic questions #16

Closed CalvinChuan closed 4 years ago

CalvinChuan commented 4 years ago

Hi ,I’m really interested in this work.But as a newcomer in this field ,I got some basic questions after I read the code.

1:Why the channels of the guidance should be half of the input in the examples of the PacConv2d? 2:Are there any rules or the relations of the shape of feature(guide),K(Gussain kernel in this papre), weight(convolution kernel)?

I’m sorry to bother you,I would appreciate it if you can answer me these questions or just help me get a better understanding of this great work. Thank you so much.

suhangpro commented 4 years ago

Hi @CalvinChuan , for a layer with input size B x C x H x W, the guidance input should be of size B x C_g x H x W, i.e. the batch size, height, width should be the same, but the depth can different. Using half C as C_g is simply a design choice that works well for our examples.

suhangpro commented 4 years ago

If you want to supply a pre-computed K to the layer (which can be efficient if it's reused more than once), it should be of size B x 1 x F x F x H x W, where F is the convolution filter size.

CalvinChuan commented 4 years ago

If you want to supply a pre-computed K to the layer (which can be efficient if it's reused more than once), it should be of size B x 1 x F x F x H x W, where F is the convolution filter size.

Thankyou so much!!