How does the VQ-VAE reduce the training cost of the proposed method?

Apologies for my delayed response, and thank you for carefully reading our paper. Even though the resolution of the feature maps has not changed, the reduction in the dimensionality from C to D has greatly decreased the computational overhead. Furthermore, we've mapped continuous input data onto a finite, discrete set of encodings, associating each part of the input data with the nearest embedding vector from a codebook. Since the size of the codebook is fixed, each part of the encoding can be represented using fewer bits. This significantly reduces the amount of information that needs to be stored and the computational resources required to process this information.

easylearningscores / PastNet

How does the VQ-VAE reduce the training cost of the proposed method? #1