Design Choices - Githubissues

bonlime commented 1 year ago

Hi! First of all thanks for releasing such a great model and accompanying paper. Could you clarify few design choices in the SDXL?

Why do you use both previous CLIP-L and new OpenCLIP ViT-bigG? Have you tried only using the later one, wouldn't it be enough?
The crop-conditioning while avoid generating too many cropped images, seems to generate more duplicated cases, where the object of interest is present everywhere, instead of being a single instance. See this comparisons. I wonder why not to use multi-aspect ( aka rectangles) training during all training process, rather than only during fine-tuning.

andreemic commented 1 year ago

Hey the link is gated, can you send the example here directly?

bonlime commented 1 year ago

@andreemic sorry, gave the wrong link. Here is the correct one.

And here are some representative examples of such behaviour

Stability-AI / generative-models