How to initialize encoder with CodeGen-mono 350M?

salesforce / CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation

https://arxiv.org/abs/2305.07922

BSD 3-Clause "New" or "Revised" License

2.74k stars 401 forks source link

How to initialize encoder with CodeGen-mono 350M? #162

Open skye95git opened 8 months ago

skye95git commented 8 months ago

From the paper "CODEGEN: AN OPEN LARGE LANGUAGE MODEL FOR CODE WITH MULTI-TURN PROGRAM SYNTHESIS", the architecture of CodeGen follows a standard transformer decoder with left-to-right causal masking. How do you use CodeGen-mono 350M (decoder) to initialize the encoder? As far as I know, there is a slight difference between encoder and decoder.