Closed anirbandey303 closed 6 months ago
Hi @anirbandey303,
Yes, the pre-trained language model (such as BERT) weights do change.
The numerical/categorical features are not used by the language (BERT) model. They are sent as features to the MLP alongside the output of the language model. The architecture diagram here might help!
Perfect, that clears out my confusion. Thanks a lot for the prompt response. 👍
Happy to help :)
Hi Developers,
I have a naive question, could you please help me understand: