Closed blurLake closed 2 years ago
Hi, I am trying to figure out how the loss is calculated, for example here. I assume it is some distance between generated_ids and target_ids with attention_masks, but could you point it out how the code and formula look like? Thank you very much!
HI, as CodeT5 is adapted from Huggingface's T5 implementation, you should refer to official documentations like here.
Hi, I am trying to figure out how the loss is calculated, for example here. I assume it is some distance between generated_ids and target_ids with attention_masks, but could you point it out how the code and formula look like? Thank you very much!