daveredrum / ScanRefer

[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
https://daveredrum.github.io/ScanRefer/
Other
236 stars 28 forks source link

Question about Language to object cls loss(lang_loss) #12

Closed cheng052 closed 3 years ago

cheng052 commented 3 years ago

Hi, zhenyu,

Thanks for your good work. However, I got a question when training the network.

lang-loss lang-acc

Firstly, I train a model from scratch with the command python script/train.py --use_multiview --use_normal, which enable the language to object classification loss(lang_loss).

The lang_loss and lang_acc are shown in the pictures.

It is very strange that the language loss on validation set is much lower than the loss on train set, which violates the basic principals of machine learning.

Also, the lang_acc is much higher on validation set than the acc on train set, which makes me more confused.

Can you check this problem? It can be of great help.

daveredrum commented 3 years ago

Hi @cheng052,

Thanks for your interest in our work! We apply a dropout (p=0.5) in the language classifier to prevent overfitting, as the model will easily overfit on the language classification subtask. That's why training loss stops to drop at some point and validation loss is way lower than the training. We've tested the model without setting the dropout, but it turned out to be a huge failure.

cheng052 commented 3 years ago

Hi @daveredrum,

Thanks for your prompt reply. It does solve my problem, thanks!