Open eiriksteen opened 1 month ago
It seems the error is related to the self.ema() function and the dimensions in q/k and self.ema_matrix doesn't match. May I have the detailed dimensions of q/k and self.ema_matrix in your experiment?
Thank you for the quick response! These are the shapes:
q: torch.Size([32, 7, 8, 21, 32]), ema: torch.Size([8, 8])
I am trying to reproduce the results from your paper, but the code doesn't run. Specifically the last line here throws an error which isn't very easy to debug for someone not familiar with the code:
This is the error:
Any idea what the problem is?