lllyasviel / sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)
Apache License 2.0
3.86k stars 331 forks source link

High-level instruction on how to use the LatentTransparencyOffsetEncoder model #96

Open kevsak5 opened 6 months ago

kevsak5 commented 6 months ago

Hi, Thanks again for this awesome tech. I see in several issues that Encoder support will come in the near future. Thank you for that. In the mean time, if I want to use LatentTransparencyOffsetEncoder and test a few things out, what's the expected input? From reading the decoder,

  1. seems like the input should be alpha then rgb. Is this correct?
  2. Are the input values [-1,1] or [0,1]
  3. From my simple testing of autoencoding, e.g. LatentTransparencyOffsetEncoder(alpha, RGB) + sdvae.encode( masked_rgb ) -> decode, it seems like not adding the offset performs better. Is this expected? Thanks again.
AbnerVictor commented 6 months ago

Me either, I am wondering how this latent offset Encoder should work.

fkcptlst commented 4 months ago

3. LatentTransparencyOffsetEncoder(alpha, RGB) + sdvae.encode( masked_rgb ) -> decode, it seems like not adding the offset performs better. Is this expected? Thanks again.

Hello, have you figured out yet?

layerdiffusion commented 4 months ago

Hey people, see also updates:

https://github.com/layerdiffusion/sd-forge-layerdiffuse/issues/90#issuecomment-2156095009