microsoft / CodeBERT

CodeBERT
MIT License
2.15k stars 442 forks source link

pre-training dataset for CodeReviewer #226

Open oathaha opened 1 year ago

oathaha commented 1 year ago

Hi. Where can I find pre-training dataset (and its metadata of projects and pull requests in GitHub) for CodeReviewer?

celbree commented 1 year ago

We don't release the pre-training dataset for CodeReviewer. But you can use this repo to get project list and use this repo for crawling.