Quantized latents are not used at all?

lucidrains / titok-pytorch

Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"

MIT License

159 stars 3 forks source link

Closed inspirit closed 2 months ago

inspirit commented 2 months ago

i looked through the paper and they decode using mask tokens + quantized latents

lucidrains commented 2 months ago

@inspirit hey! thanks as usual 😄