Can I use non-empty text with zero123++?

SUDO-AI-3D / zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Apache License 2.0

1.56k stars 108 forks source link

Can I use non-empty text with zero123++? #84

Closed moonryul closed 1 month ago

moonryul commented 1 month ago

In section 2.4 "Global Condition: FlexDiffuse" of the paper

it is said:

In the released Zero123++ models, we do not impose any text conditions, so T is obtained by encoding an empty prompt.

Does it imply that I cannot use non-empty text prompt with the current release of the zero123++ ??

eliphatfs commented 2 weeks ago

You actually can find some interesting behaviors when using texts. Sometimes you can get your text correlated in the non-seen regions in your input. But since we do not train on any text, this is an artifact in prior preserving, and may or may not work in practice.