Closed LeeSureman closed 3 years ago
No, we actually didn't tune hyper-parameters individually for the languages. So, a slight difference is expected. I think we should run an experiment multiple times (~3) with different seed values and report the average performance.
Hi, I read your work and I think it's very nice. In recent I run the code summarization experiment follow your readme as follow: bash run.sh 1 [pl_lang] and I got the performance table as follow:
I notice the result of plbart on some pl like "Go" seems lower than paper reported result. Does every programming language have its own hyper-parameters on the task?