Closed henry-tujia closed 8 months ago
Looks like this should work if you add the name to the lm_head_namings
attribute of the AutoModelForSeq2SeqLMWithValueHead
.
I'll give it a try and share the outcome. Thanks for the response!
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
I observed that this function checks for the existence of the
lm_head
using keywords. However, when dealing with an Encoder-Decoder model, it fails to detect the linear layer (BertOnlyMLMHead), which serves as the final layer.Error Message
Bert2Bert Model Code
Bert2Bert Model Structure