JiahuiYu / generative_inpainting

DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral
http://jiahuiyu.com/deepfill/
Other
3.28k stars 789 forks source link

model size #45

Closed minmax100 closed 6 years ago

minmax100 commented 6 years ago

Hi Jiahui,

Nice work! I have a question about the model size. In your provided trained model, the size is pretty small (e.g., around 14 MB). However, the output after running your train.py is far larger (e.g., 126M). Just wonder did you conduct any post-processing to the model? or is there any parameters in your code to tune the model size? (any trade-off for such a compression?)

[Released model]

 69 May 22 16:15 checkpoint*

14M May 22 16:15 snap-0.data-00000-of-00001 3.6K May 22 16:15 snap-0.index 14M May 22 16:15 snap-0.meta*

[output of train.py]

77 May 23 15:50 checkpoint 84M May 23 20:01 events.out.tfevents.1527040937.dgx1-server2 126M May 23 15:50 snap-10000.data-00000-of-00001 12K May 23 15:50 snap-10000.index 14M May 23 15:50 snap-10000.meta

JiahuiYu commented 6 years ago

The released places2 model is only the generator model (celeba-hq has both generator and discriminator). The model size for your checkpoint is large is mainly because the weights for discriminators are included.