salesforce / CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation
https://arxiv.org/abs/2305.07922
BSD 3-Clause "New" or "Revised" License
2.66k stars 391 forks source link

Do you plan to open source your data processing scripts or pre-training data sets #110

Open skye95git opened 1 year ago

skye95git commented 1 year ago

Hi, promising work! I want to reproduce your work. Do you plan to open source your data processing scripts or pre-training data sets?