Open Delicious-Bitter-Melon opened 2 months ago
Hi, the generation is operated on the discrete token space through MAGVIT-v2.
Hi, the generation is operated on the discrete token space through MAGVIT-v2.
Thanks for your reply. Does MAGVIT-v2 directly tokenize from pixel space?
Exactly.
Thanks for your excellent work.
Does Show-o directly complete generation in pixel space, or does it complete generation in latent space through a VAE?