I see that in multiple locations in the code where parameters are initialized you are using torch.empty,
which looks like this: nn.Parameter(torch.empty( ...
I assume that this is to speedup model loading, since when used purely for inference it will be overridden by the checkpoint params anyway.
However, I'm trying to reproduce your training, how do you initialize the following within modules.py ?
I see that in multiple locations in the code where parameters are initialized you are using torch.empty, which looks like this:
nn.Parameter(torch.empty( ...
I assume that this is to speedup model loading, since when used purely for inference it will be overridden by the checkpoint params anyway.
However, I'm trying to reproduce your training, how do you initialize the following within modules.py ?
Class Node2Edge
Class Attention: 1.self.qg_weights - 2 self.kv_weights - 3 self.qg_bias - 4 self.kv_bias -
class GeometricAttention:
Thanks!