p2ch11: label_g[:,1] as target label in computeBatchLoss() ?

deep-learning-with-pytorch / dlwpt-code

Code for the book Deep Learning with PyTorch by Eli Stevens, Luca Antiga, and Thomas Viehmann.

4.69k stars 1.98k forks source link

Hi,

I'm having trouble understanding the label_g[:,1] used in computeBatchLoss():

https://github.com/deep-learning-with-pytorch/dlwpt-code/blob/d6c0210143daa133bbdeddaffc8993b1e17b5174/p2ch11/training.py#L225-L238

Assume that batch size is 32, the logits_g will have shape [32, 2]. And the label_g have the same size [32, 2], if I didn't get it wrong, it should be the one-hot vector defined in https://github.com/deep-learning-with-pytorch/dlwpt-code/blob/d6c0210143daa133bbdeddaffc8993b1e17b5174/p2ch11/dsets.py#L203-L210

My quesetion is that in the CrossEntropyLoss function, should we use label_g instead of label_g[:,1] ( which take the 2nd column for each item)? Something like:

        loss_g = loss_func(
            logits_g,
            label_g, # the one-hot vector instead of label_g[:,1]
        )

        loss_g = loss_func(
            logits_g,
            torch.argmax(label_g, dim=1), # if we want to use the index 
        )

Thanks

deep-learning-with-pytorch / dlwpt-code

p2ch11: label_g[:,1] as target label in computeBatchLoss() ? #100