Hi, glad to see u , i am reading your loss design now , and found code below ` class RKdAngle(nn.Module): def forward(self, student, teacher):

N x C

    # N x N x C

    with torch.no_grad():
        td = (teacher.unsqueeze(0) - teacher.unsqueeze(1))
        norm_td = F.normalize(td, p=2, dim=2)
        t_angle = torch.bmm(norm_td, norm_td.transpose(1, 2)).view(-1)

    sd = (student.unsqueeze(0) - student.unsqueeze(1))
    norm_sd = F.normalize(sd, p=2, dim=2)
    s_angle = torch.bmm(norm_sd, norm_sd.transpose(1, 2)).view(-1)

    loss = F.smooth_l1_loss(s_angle, t_angle, reduction='elementwise_mean')
    return loss

` both in rkd angle and rkd distance , there is a " torch.no_grad" in teacher related code . is that essential ? can that be removed ?

lenscloth / RKD

in loss design , is the code " torch.no_grad " essential ? #16

N x C