-
## Training ##
- 50/50 split (logs/no logs)
- Assume Apache Hadoop as ground truth.
- Parse with AST
- 2000- 4000 samples
## Validation ##
- "honest" to ground truth
- probably 90/10 split
…
-
Thanks for opening source this, it's great work.
I have been trying to run it on another dataset, specifically the `java-small` dataset from your code2seq work which I found at https://urialon.cswp…
-
Hi, I tried to run the model using your example. However if I run `python3 code2vec.py --load models/java14_model/saved_model_iter8 --predict` I get the error:
`Loading word frequencies dictionarie…
-
Hi, am I right that I can use ur NN for finding clones with similar semantics via vectors distance?
Thx for the answer.
-
On downloading the trained model, I get dictionaries.bin. However, on training the network with preprocessed data from https://s3.amazonaws.com/code2vec/data/java14m_data.tar.gz, I'm unable to generat…
-
# Next paper candidates
Let's propose papers to study next! All papers mentioned in the comments of this issue will be listed in the next vote.
## Last session runner-up(s)
-
-
-
Hello, I'm interested in this project.
But when I try to evaluate your network through your java14m dataset, there is a small error in the Dataset.map function in the self._create_dataset_pipeline. T…
-
Hello I really wish to adapt your code to C/C++
Do you have any idea to do this, I really appreciate you help
I look forward to hearing from you
-
Given a python piece of code:
```
print("hello")
```
How can I extract its associated UAST contexts paths?, I tried to explore `sourced` library and see if there's a function for extracting t…
-
When I try to run the training with tf2.0 in docker with gpu support, I get the following error:
```docker run --runtime=nvidia -it -v $(realpath ~/Code):/code -u $(id -u):$(id -g) tensorflow/tenso…