Question of Detail in Dimp

visionml / pytracking

Visual tracking library based on PyTorch.

GNU General Public License v3.0

3.24k stars 605 forks source link

Hi. Thanks for your question. The reason is that the transpose of the convolution operation itself involves also flipping the filter along all spatial dimensions. This is most easily seen for a simple scalar 1-D convolution by writing the corresponding convolution matrix explicitly. The flipping is not performed by the F.conv_transpose2d function, and therefore needs to be performed explicitly. Note that we have several different implementation of the filter transpose (also ones that do not use F.conv_transpose2d). All have been tested against torch autograd to generate the correct output up to numerical precision.

If you want, you can instead use our general steepest descent optimizer, which is implemented with double back-propagation and therefore automatically applies the transpose.

visionml / pytracking

Question of Detail in Dimp #167