Questions on loss calculation

ilia10000 / dataset-distillation

Soft-Label Dataset Distillation and Text Dataset Distillation

MIT License

73 stars 6 forks source link

I have several questions about this. First of all, I noticed that the calculation of Loss for a two-class model was treated differently. The model I am using being of two classes, errors were appearing (including memory limits). So I removed this condition to calculate only with the cross entropy F.cross_entropy(output, target) because a binary classification can be treated as multi-class classification:

The same thing was done at the different locations where the Loss was calculated.

I have some doubts about the loss values calculated in the train() function of train_distilled_images ... Indeed, the loss increases at each step instead of decreasing...

ilia10000 / dataset-distillation

Questions on loss calculation #4