Closed X-fxx closed 3 months ago
Thanks for your comment. Our region planning is only designed for providing proper initial position and size. Hence, in the denoising process, we enable model to adaptively modificate the size of objects for better generation quality.
Why does this person's avatar appear in the moon area? Which part of the method in the text does it correspond to?