Since the name of the methods, the name of the variables, and also the signature of the methods all carry semantic signals that are important for code representation and retrieval, is there a version of the Code Search (AdvTest) dataset where these names defined by the programmers are not anonymized?
You can download this raw dataset wget https://s3.amazonaws.com/code-search-net/CodeSearchNet/v2/python.zip and find original source codes that not anonymized by URL。
Since the name of the methods, the name of the variables, and also the signature of the methods all carry semantic signals that are important for code representation and retrieval, is there a version of the Code Search (AdvTest) dataset where these names defined by the programmers are not anonymized?