tech-srl / code2vec

TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"
https://code2vec.org
MIT License
1.09k stars 286 forks source link

Vocabulary of code2vec in dataset #128

Open anon721702 opened 3 years ago

anon721702 commented 3 years ago

Dear Sir,

urialon commented 3 years ago

Hi @anon721702 , Thank you for your interest in code2vec, and sorry for the delayed response.

You can reproduce the way that our code opens this file: https://github.com/tech-srl/code2vec/blob/master/vocabularies.py#L75 to see its content.

You can see how this file was created here: https://github.com/tech-srl/code2vec/blob/master/preprocess.py#L16

Best, Uri

urialon commented 3 years ago

By the way, See our newer code2seq model's demo and code.