How does the input dimension change in resBlock as padding is always "SAME"?

yfeng95 / PRNet

Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network (ECCV 2018)

MIT License

4.96k stars 944 forks source link

` 
def resBlock(x, num_outputs, kernel_size = 4, stride=1, activation_fn=tf.nn.relu, 
normalizer_fn=tcl.batch_norm, scope=None):

    assert num_outputs%2==0 #num_outputs must be divided by channel_factor(2 here)
    with tf.variable_scope(scope, 'resBlock'):
        shortcut = x
        if stride != 1 or x.get_shape()[3] != num_outputs:
            shortcut = tcl.conv2d(shortcut, num_outputs, kernel_size=1, stride=stride, 
                        activation_fn=None, normalizer_fn=None, scope='shortcut')
        x = tcl.conv2d(x, num_outputs/2, kernel_size=1, stride=1, padding='SAME')
        x = tcl.conv2d(x, num_outputs/2, kernel_size=kernel_size, stride=stride, padding='SAME')
        x = tcl.conv2d(x, num_outputs, kernel_size=1, stride=1, activation_fn=None, padding='SAME', normalizer_fn=None)
        x += shortcut       
        x = normalizer_fn(x)
        x = activation_fn(x)
    return x

In the resBlock above, all the three convolution layers have padding as SAME. Then how does the input image height and width decrease during the encoding part?

yfeng95 / PRNet

How does the input dimension change in resBlock as padding is always "SAME"? #62