Closed faysalhossain2007 closed 3 years ago
Hi @faysalhossain2007 , Sorry for the delayed response.
Q1: If I understand your question correctly, each of code2vec and an LSTM can capture the context on their own.
Q2: That's a good question that unfortunately, I don't have an answer for. One possible way would be to write a new Extractor (according to the format here) and use an AST format the fits multiple languages, like the Github Semantic .
Best, Uri
Closing due to inactivity, but feel free to re-open if you have additional questions.
Q1. We want to collect the code context. For example,
one part of the function provide us the sorted list while other one reversed list. Now if I want to capture this context, does the following approach seem reasonable? use the vector generated by code2vec, use my own lstm-model trained with labeled data, evaluate it on the test data?
Q2: As my dataset contains more than one programming language, do you have any suggestion on the best way to combine embedding vectors?
Thanks for the help! I appreciate for making your tool publicly available.