Haiyang-W / DSVT

[CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"
https://arxiv.org/abs/2301.06051
Apache License 2.0
361 stars 28 forks source link

Why window sizes are set as multiples #1

Closed Angericky closed 1 year ago

Angericky commented 1 year ago

Hi, thanks for your amazing paper and solid experiments, which proposes a fast, easy-to-deploy, and remarkably performed transformer backbone.

I wonder about the hyrid window sizes. Why did you set 2nd window size N times of 1st window size? In this way, does the model skip inter-window voxel relation between even adjacent window pairs of the first partition?

chenshi3 commented 1 year ago

We utilize the window shift (half of the 1st window size) between the 1st window size and 2nd window size, which introduces connections between even adjacent window pairs of the first partition.