Closed leyi-123 closed 1 year ago
Hmm it seems as if what is being outputted by your network is lacking a temporal dimension. The output should be 4x200 as there should be 4 patches in the sequence each of 200-d to match the number of elements in the codebook. Could you please check to see whether there is an indexing problem into the output temporal dimension?
Hello, when I trained the predictor model based on the provided VQ-VAE model, I got the runtime error:
python -u train_vq_decoder.py --config configs/vq/delta_v6.json
I want to know how to solve it. Thanks in advance!