microsoft / CodeBERT

CodeBERT
MIT License
2.09k stars 430 forks source link

Where do you get the source code to include in jsonl? #319

Open Daocuong-main opened 2 months ago

Daocuong-main commented 2 months ago

I'm having to build a language model to classify clones. I want to do a little editing in the data so I need to get it from the original BigCloneBench data. However, when I download the data from BigCloneBench's Github, it only has the H2 database and when I open it, there are no tables containing the source code. So I want to ask, where did you get the source code to create the jsonl file?