Conv2d? - Githubissues

Thank you for your question. The choice of using nn.Conv2d over nn.Conv1d in our embedding layers comes from careful considerations outlined in Section 4.3 of our paper.

In summary: Accuracy with Efficiency: Breaking down 1D convolution kernels into separate temporal and spatial components, we achieve enhanced accuracy without substantial computational overhead. Similar to the Inverted Bottleneck concept, it expands input channels via temporal convolutions and then projects the result back to its original size with spatial convolutions.

Inspired by FFN in Transformers: Our strategy of expanding and projecting hidden states is also inspired by the Feed Forward Network (FFN) in transformers, designed to capture spatial interactions effectively.

This convolution type excels at capturing channel interactions in multivariate time series, as highlighted in the Dis-JointCNN paper.

Navidfoumani / ConvTran

Conv2d? #2