salesforce / CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation
https://arxiv.org/abs/2305.07922
BSD 3-Clause "New" or "Revised" License
2.71k stars 396 forks source link

Missing 'Train.jsonl' in Python Summarization #8

Closed RecklessRonan closed 2 years ago

RecklessRonan commented 2 years ago

Hi, thanks for your excellent work.

I am fine-tuning code summarization task and I found there&prefix=&forceOnObjectsSortingFiltering=false) misses 'train.jsonl' in summarization of python language.

I note that there is a little difference between your dataset and data in CodeXGLUE.

Could you please upload your 'train.jsonl'?

yuewang-cuhk commented 2 years ago

Hi, thanks for spotting this. We have uploaded the train.jsonl for Python.