Failed to try with SDXL

beichenzbc / Long-CLIP

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Apache License 2.0

620 stars 30 forks source link

Failed to try with SDXL #7

Closed ziyye closed 2 months ago

ziyye commented 6 months ago

SDXL have two clip text encoders: CLIP-L and CLIP-G. I replace the original CLIP-L text encoder with long-CLIP-L, then padding embeddings of original CLIP-G to length 248 (248 is the length of long-clip-L embeddings) and concat those embeddings with embeddings from long-CLIP-L. But the generated images not good. Anyone tried long-clip with SDXL, should it work?

GongXinyuu commented 6 months ago

SDXL have two clip text encoders: CLIP-L and CLIP-G. I replace the original CLIP-L text encoder with long-CLIP-L, then padding embeddings of original CLIP-G to length 248 (248 is the length of long-clip-L embeddings) and concat those embeddings with embeddings from long-CLIP-L. But the generated images not good. Anyone tried long-clip with SDXL, should it work?

Hi @ziyye would you mind sharing some images generated by your SDXL + long-CLIP-L implementation?

ziyye commented 6 months ago

Thanks for your reply! Here are some images generated: