MountaintopLotus / braintrust

A Dockerized platform for running Stable Diffusion, on AWS (for now)
Apache License 2.0
1 stars 2 forks source link

Embeddings #44

Open JohnTigue opened 1 year ago

JohnTigue commented 1 year ago

Embeddings seem to be like finetuned models, but different. Some folks seems to be expressing preferences for embeddings over models. No idea what is going on.

JohnTigue commented 1 year ago

Looks like embeddings are just another name for models trained via textual inversion. Might just have a dup issue here, see #2.

JohnTigue commented 1 year ago

But embeddings are small files (say, ~10KB) which is much smaller than full models (gigabytes). They can be either style or object centric.

JohnTigue commented 1 year ago

Embeddings seem to be a SD2.1 thing, or at least they work together well. The embedding files are very small (tens of kilobytes) compared to fine tuned full models (~4GB). The have trigger words (often of the form foo-style) which can clearly demonstrate what they do. Powerful. Kinda like mix-ins.

JohnTigue commented 1 year ago

Seems both embeddings and checkpoints are still being trained by textual inversion AND Dreambooth, given that InvokeAI 2.2 is planning on support both.

Screen Shot 2023-01-15 at 12 03 01 PM
JohnTigue commented 1 year ago

In the finetuning space, it seems textual inversion has recently pulled up alongside DreamBooth which was supposedly gave better results in the SD 1.x timeframe.

Seems that textual inversion embedding took off with SD2.0. In earlier SD 1.x releases, the results were not as impressive as using DreamBooth. But once SD2.0, they were competitive with DreamBooth. Additionally, DreamBooth gives you a whole new multi-gigabyte model checkpoint file (.ckpt), versus embedding which are on the order of kilobytes [*].