Hi, thank you for your wonderful work! I notice that in the training process, the loss function is composed of the attention loss and the cam_up_similarity. Since the cam_up_similarity was not discussed in the paper, may I ask what is the role of cam_up_similarity?
Hi, thank you for your wonderful work! I notice that in the training process, the loss function is composed of the attention loss and the cam_up_similarity. Since the cam_up_similarity was not discussed in the paper, may I ask what is the role of cam_up_similarity?