isl-org / DPT

Dense Prediction Transformers
MIT License
1.96k stars 254 forks source link

auxiliary output #46

Closed Nickiris closed 1 year ago

Nickiris commented 3 years ago

"We employ a cross-entropy loss and add an auxiliary output head together with an auxiliary loss to the output of the penultimate fusion layer." According to your paper,the resolution of the auxiliary output is one-fourth of the input image. Do you upsample the auxiliary output to input?How do you design the auxiliary loss?

Nickiris commented 3 years ago

I wonder how to fine-tune DPT-Hybrid on the Pascal Context dataset,I couldn't get the results 60.46% on it.

ranftlr commented 3 years ago

Yes, the output of the auxiliary layer is upsampled to the shape original size using bilinear upsampling. The auxlayer is specified in the inference code. It is applied to "path_2" in the DPT base model. For fine-tuning:

Nickiris commented 2 years ago

Thanks for your reply!