Open lkfo415579 opened 3 years ago
Could you please elaborate? Loss on negative labels shouldn't be a problem right ?
Oh, I see what you mean. All graphs are placed along the diagonal of the batched graph so that the individual graphs are independent and disjoint. I tried masking the loss matrix by removing the (-ve) edges loss by 0. However, my loss goes out of bound and gets thrown to a huge negative value.
Because the final adj matrix(label) will be a big graph which across all batch sample. There will be so many extra negative labels if you just calculate bce loss simply.