Closed ghost closed 2 years ago
@yuewang-cuhk hi yue could you help on this ? really thanks.
Hi @runningmq , sorry for the late response. It seems your replicated inference script might have some discrepancy with the one we use. I would suggest that you can try to employ our released script to reproduce the results. Please refer to here for more details.
hi,
for c# java translation task. I see the code_bleu is not reported in the paper. could you share the scores ? or can the translation result can be published ? The code bleu score is important for this task.
Thanks
I downloaded the released model and run inference on java-C# translation task. I got the result as below which not matched in the raw paper:
cs to java translation
scores
BLEU: 77.79 ngram match: 0.7778875426766637, weighted ngram match: 0.7859241463045725, syntax_match: 0.9075318329182916, dataflow_match: 0.9004485422377274 CodeBLEU score: 0.8429480160343139 EM: 0.649, = 649/1000
java to cs translation
BLEU: 81.57 ngram match: 0.8157761914953569, weighted ngram match: 0.827130874395443, syntax_match: 0.8968348170128586, dataflow_match: 0.9094303577631122 CodeBLEU score: 0.8622930601666927 EM: 0.618, = 618/1000