salesforce / CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation
https://arxiv.org/abs/2305.07922
BSD 3-Clause "New" or "Revised" License
2.74k stars 401 forks source link

How to use encoder from CodeT5+ #128

Open dongchirua opened 1 year ago

dongchirua commented 1 year ago

Hi,

I wonder how I can use the encoder part to extract embedding.

yuewang-cuhk commented 1 year ago

Hi there, we have released the embedding model of CodeT5+ 110M here, which achieves very competitive performance on multiple text-to-code retrieval tasks. The CodeT5+ 110M embedding model can extract 256-dimensional embeddings for both code and text.

aniucw commented 4 months ago

您好,我们在这里发布了 CodeT5+ 110M 的嵌入模型,它在多个文本到代码检索任务上实现了极具竞争力的性能。CodeT5+ 110M 嵌入模型可以提取代码和文本的 256 维嵌入。

Hello, I want to implement an encoder - decoder program translation task through codeT5. But I need to manipulate the encoder generated embeddings. Is there any way to achieve this process?