Open sxthunder opened 1 month ago
Hi @sxthunder ,
For the RepoEval scores reported in the paper, we used the second implementation, which is taken from https://github.com/microsoft/CodeT/blob/35f54d60b152cc31d134b788e702878ad613d9f7/RepoCoder/compute_score.py#L23-L33.
Your work has significantly contributed to the field, and I am currently engaging with the RepoEval dataset to further my understanding based on your research.
While exploring the codebase associated with the paper, I came across two separate functions designed to calculate ES scores. During my experiments, I observed that each function yields different es_scores, which led to some confusion. To ensure the accuracy of my work and to better align with the methodologies of your study, may I kindly ask which specific function was utilized to report the results in your publication?
Function 1:
Function 2: