Closed JasonLeeFdu closed 2 months ago
It says that there are 2 params which are not found in the pretrained weights. It is very unsettling since I copyed the code from huggingface-1.8B model card- quick start.
Still
The post_layernorm
is applied after the output of the ViT last layer, which is not used in our model and we delete them during training. So you can ignore this log, it does not affect the performance.