Nota-NetsPresso / BK-SDM

A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]
Other
238 stars 16 forks source link

how about kd trianing without ema? #47

Closed DL-Practise closed 8 months ago

DL-Practise commented 9 months ago

thanks for your paper and code. my question is how about the model performance when i not use the eam option. it means i didn't pass the option "--use_ema"

bokyeong1015 commented 8 months ago

Hi,

In the below test with our code, we didn't observe notable differences in

Whether to use the EMA checkpoint or not seems quite debatable [discussion link].


Setup

Visual Results (50000-th iteration)

image

Quantitative Results