salesforce / CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation
https://arxiv.org/abs/2305.07922
BSD 3-Clause "New" or "Revised" License
2.68k stars 394 forks source link

Generation task (SysML) #68

Closed mirzayaser3 closed 1 year ago

mirzayaser3 commented 1 year ago

I want to create my own dataset for the generation task. I want to convert text to SySML code. The SysML code examples are here (https://github.com/Systems-Modeling/SysML-v2-Release/tree/master/sysml/src/examples).

In the data/concode folder I want to give my own dev, test, and train.json files. But the thing is will I be able to generate SysML code? Is the current codeT5 compatible with my task? I am starting to research this code. Any suggestions or ideas are appreciable. Thanks.

yuewang-cuhk commented 1 year ago

Hi there, please follow here to finetune CodeT5 on the customized dataset. You would be suggested to create a new task for it.