google-research-datasets / great

The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/forum?id=B1lnbRNtwr]
Other
22 stars 12 forks source link

dataset #2

Open Gracccce opened 2 years ago

Gracccce commented 2 years ago

I'd like to make a new dataset, i am wondering how to get the graph edge types ? Could you please tell me more details about how to deal with your dataset

VHellendoorn commented 2 years ago

Hi, the repository we used to parse Python code and generate program graph has been open-sourced here. This won't output samples in exactly the same format as in this dataset, but contains all the building blocks needed to get the same information.

Hope that helps! -Vincent