[VDCNN] 질문 - Githubissues

VDCNN에서는 3가지 방식의 down sampling 방식을 실험합니다.

(i) The first convolutional layer of Ki+1 has stride 2 (ResNet-like). (ii) Ki is followed by a k-max pooling layer where k is such that the resolution is halved (iii) Ki is followed by max-pooling with kernel size 3 and stride 2 (VGG-like).

첫번째 방법은 stride가 2인 cnn을 사용하여 길이를 반으로 줄입니다. 이는 1D CNN을 이해하면 쉬우실 겁니다. kernel size가 3이고 padding이 2라면, stride가 2일 때 문장 길이에 해당하는 차원이 절반이 됩니다. 문장 길이가 20이었다면 padding 후에 22, 1d cnn 후에는 11이 됩니다.