pre-train on ImageNet data?

fenglinglwb / EDT

On Efficient Transformer-Based Image Pre-training for Low-Level Vision

130 stars 9 forks source link

Closed jiamingNo1 closed 1 year ago

jiamingNo1 commented 2 years ago

I wonder how to pretrain on ImageNet dataset?

for example, the Imagenet is always for object recognition, if our task is super-resolution, how do we construct the dataset?
the network is a hybrid (conv+transformer) architecture, when loading these pre-trained params, we load all the weights?

fenglinglwb commented 2 years ago

Thanks for your attention.

Sec.3.1 shows that "In terms of SR, we utilize bicubic interpolation to obtain low-resolution images. As for denoising and deraining, Gaussian noises (on RGB space) and rain streaks are directly added to the clean images."
Yes. Sec.3.1 shows that "The fine-tuning is performed on a single task, where the model is initialized with the pre-trained task-specific encoder and decoder as well as the shared transformer body."

jiamingNo1 commented 2 years ago

okay，thank you

jiamingNo1 commented 2 years ago

I have another question, how to choose 200k or 400k imagenet data, random or through some other way?

fenglinglwb commented 2 years ago

Randomly.

jiamingNo1 commented 2 years ago

ok！