Intelligent-CAT-Lab / PLTranslationEmpirical

Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In Proceedings of The 46th IEEE/ACM International Conference on Software Engineering (ICSE 2024), Lisbon, Portugal, April 2024
https://zenodo.org/doi/10.5281/zenodo.8190051
MIT License
37 stars 8 forks source link

Query about the cleaning code #3

Open TheMarsDescends opened 2 months ago

TheMarsDescends commented 2 months ago

Hey~, I got some problems with the cleaning of generation codes. It would be appreciated if you could help me out. After I used clean_generations.py code to clean the generated translations of StarCoder, I found the quality of cleaning very poor and the codes can not achieve the results in the Artifacts RQ1 . Are there any other post-processing techniques to use before testing the translation performances?

alibrahimzada commented 1 month ago

Can you please give more details? Did you use the cleaning script on our starcoder generated translations? Have you checked the starcoder translations from artifacts website on zenodo?