how to process my own data?

tech-srl / code2seq

Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"

MIT License

556 stars 164 forks source link

Hi, Thank you for your interest in code2seq!

For the format see a description here: https://github.com/tech-srl/code2seq/blob/master/README.md#extending-to-other-languages

Alternatively, you can simply run preprocessing on a small directory (e.g., you can run preprocessing on the JavaExtractor code itself) and see the format.

The TRAIN_DIR is the source of the data, not the target. The data is generated at "data/${dataset_name}/" dir. See also the comment on the top of "preprocess.sh".

Best, Uri

tech-srl / code2seq

how to process my own data? #27