salesforce / CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation
https://arxiv.org/abs/2305.07922
BSD 3-Clause "New" or "Revised" License
2.68k stars 394 forks source link

Task Control Codes #70

Closed mohamed-abdelaziz721 closed 1 year ago

mohamed-abdelaziz721 commented 1 year ago

I was using hugging faces codet5-base to try code generation and understanding tasks.

I cannot find documentation anywhere that indicates how to use "task control codes" for different input types.

I am trying to do something like Figure 1 in the original paper

yuewang-cuhk commented 1 year ago

Hi there, the codet5-base checkpoint is the one trained with the proposed identifier-aware denoising pretraining objective, while "task control codes" are only used for the multi-task finetuning. So you need to finetune CodeT5-base checkpoint on downstream tasks or directly use our released checkpoints.