Closed ghost closed 3 years ago
Hi,
Thanks for the share.
Based on your equation 3,
The tilde y in here is the pre-generated hard pseudo label, and y hat is the model prediction. However, according to your code in here,
y_nl = torch.ones((nl_logits.shape)).to(device=args.device, dtype=logits.dtype) loss_nl += torch.mean((-torch.sum((y_nl * torch.log(pred_nl))*nl_mask, dim = -1))/(torch.sum(nl_mask, dim = -1) + 1e-7))
it looks like you only multiply the log(1- y hat) with the 1, instead of the (1-y tilde).
Do I miss something?
Cheers,
Hi,
Thanks for the share.
Based on your equation 3,
The tilde y in here is the pre-generated hard pseudo label, and y hat is the model prediction. However, according to your code in here,
it looks like you only multiply the log(1- y hat) with the 1, instead of the (1-y tilde).
Do I miss something?
Cheers,