How to train models with contrastive losses, masks, etc.

salesforce / CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation

https://arxiv.org/abs/2305.07922

BSD 3-Clause "New" or "Revised" License

2.65k stars 391 forks source link

How to train models with contrastive losses, masks, etc. #158

Open 18liumin opened 7 months ago

n1o commented 1 month ago

I would like to see the actual pretraining of CodeT5, I am reading up on the Identifier Tagging objective, where you transform the contextual representation of the encoder into a vector of probabilities. Unfortunately there no discussion on how, I assume there is an ProjectionLayer with an L2 norm that gets fed into a sigmoid. I came to look up the code, but I either to struggle to find it, or I am just blind.