Can you please explain your implementation of loss for the attribute classification branch. I'm no able to comprehend the 2*1000 output layer and the use of cross entropy on that. That data is multi-label classification if I'm not wrong, how do you modify the cross entropy loss to use on multi-label data, it should've been Sigmoid+BCELoss if I'm following the online PyTorch tutorials correctly.
Hi authors,
Can you please explain your implementation of loss for the attribute classification branch. I'm no able to comprehend the 2*1000 output layer and the use of cross entropy on that. That data is multi-label classification if I'm not wrong, how do you modify the cross entropy loss to use on multi-label data, it should've been Sigmoid+BCELoss if I'm following the online PyTorch tutorials correctly.
Thanks