liuqk3 / PUT

Paper 'Transformer based Pluralistic Image Completion with Reduced Information Loss' in TPAMI 2024 and 'Reduce Information Loss in Transformers for Pluralistic Image Inpainting' in CVPR2022
MIT License
173 stars 15 forks source link

About Dual-Codebook for Vector Quantization #33

Open tanbuzheng opened 7 months ago

tanbuzheng commented 7 months ago

Dear author, Thanks for sharing the code. I am greatly interested in your work. I have a question for you and would like your reply. In P-VQVAE, the masked and unmasked patches are encoded by linear layers independently, without considering any contexts. Does this influence the quality of the semantic codebooks, especially the codebook for the masked regions?