salesforce / CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation
https://arxiv.org/abs/2305.07922
BSD 3-Clause "New" or "Revised" License
2.68k stars 392 forks source link

unable to regenerate output #39

Closed Pked01 closed 2 years ago

Pked01 commented 2 years ago

Hi Thanks for sharing such useful models I am trying regenerate the output as shared by you in the gif as per gif the input is //convert one currency to another for generating the output I use code-t5-base model from hugging face and replaced the binary file with concode_codet5_base(from finetune models) But if you check output below its lot different from what is there in the gif image

My question are:

  1. Is my approach correct to run the model?
  2. If what should be the recommended setting to get the correct output for text to code generation
yuewang-cuhk commented 2 years ago

Hi, the model behind the GIF is a CodeT5 checkpoint fine-tuned on our internal Apex data for a text-to-code generation task. Currently the released text-to-code checkpoint is specifically for the Concode dataset and there is no released models for generic text-to-code generation. You are recommended to fine-tune such model on your domain datasets using the CodeT5 pretrained models.