Open carlos-gemmell opened 4 years ago
Sorry for the late reply. The 30.69 result you get is correct as is shown in the paper (without reranking)
To perform reranking, follow this part https://github.com/neulab/external-knowledge-codegen#reranking
Thanks!
Hi
I am trying to reproduce the numbers stated in the paper for appropriate comparisons to a paper I am writing. But when I run the following command I get a corpus BLEU score of 30.69.
I am guessing the reranker is not used in the generation of the results.
To solve this I accessed the testing function directly to generate hyps and evaluate them with the same BLEU functions.
This gives me a similar corpus BLEU score of 30.078 and an average sentence BLEU score with NLTK with smooth_fn3 of 25.295.
What are the necessary commands in sequence to get the score from the paper?