Clarification on Additional Token Usage and Embedding in Maskgit-pytorch Transformer

valeoai / Maskgit-pytorch

MIT License

145 stars 15 forks source link

Hi. Thanks for the great work. I have two questions.

Can you please clarify what the second 1 is used for? codebook_size is 1024, so its indices are between [0, 1023]. The first 1 in the code is for the mask token, which is 1024. nclass is 1000 for ImageNet. I do not understand the purpose of increasing the nn.Embedding with another 1. Link to code
In the following code, why is self.codebook_size+1 used instead of self.codebook_size? What is the purpose of the additional token when after that we compute the cross-entropy loss? Link to code

valeoai / Maskgit-pytorch