Because the projection matrix is created on the fly during the first forward pass, it isn't present in a brand new model. Thus, when loading from a state dictionary with parameters, the loaded state dictionary will be rejected until the model goes through at least one forward pass.
Because the projection matrix is created on the fly during the first forward pass, it isn't present in a brand new model. Thus, when loading from a state dictionary with parameters, the loaded state dictionary will be rejected until the model goes through at least one forward pass.