microsoft / CodeXGLUE

CodeXGLUE
MIT License
1.5k stars 363 forks source link

Code Summarization : What is goldfile or reference file in BLEU.py? #147

Open ChiragM-Hexaware opened 1 year ago

ChiragM-Hexaware commented 1 year ago

I was trying to execute the code on Kaggle and saw the reference file's path is passed through the command line arguments. I am unable to find that file plus the path for it. Can you please tell me which file I have to be passed there, It comes up as gold file further

celbree commented 1 year ago

After preprocessing the dataset, you will get .jsonl files. You can extract all the docstrings as reference in this format https://github.com/microsoft/CodeXGLUE/blob/main/Code-Text/code-to-text/evaluator/reference.txt. Then run evaluator.py to do evaluation.