MC-E / DragonDiffusion

ICLR 2024 (Spotlight)
Apache License 2.0
695 stars 21 forks source link

Why does Figure 3 give different results in paper ? #4

Closed daidaiershidi closed 1 year ago

daidaiershidi commented 1 year ago

Thanks for the interesting work! According Figure 2, the same noise z_{T} is passed through different branches but with the same parameters. The features obtained (no matter which layer) should also be the same, and all losses should also be a fixed value, (cos_similarity is 1). But why do different layers of features give different results?

MC-E commented 1 year ago

The similarity here is not a fixed value, because the features involved in the similarity calculation come from different spatial locations.

daidaiershidi commented 1 year ago

The similarity here is not a fixed value, because the features involved in the similarity calculation come from different spatial locations.

I'm sorry I didn't understand. In Figure 3, you set m^{gen}, m^{gud} as zeros matrix and m^{share} as a ones matrix. In this way, why are the features involved in calculating similarity from different spatial locations?

MC-E commented 1 year ago

Hi, in the experiment shown in Figure 3, the input to the generation branch is random noise, while the input to the guidance branch is the inversion result. Therefore, the loss is not a fixed value.