Closed yuhui-zh15 closed 3 years ago
Hi Yuhui, thanks for your questions! The base value is 498, we had to ignore two articles in the end, our annotators found they were in Gaelic and left them without assessing them.
Thanks for pointing this out. We missed to provide span ids, we needed that to count "at least one word that was annotated by all three annotators". I will address these issues and also release the script to estimate our scores in a couple of days.
Hi Shashi,
I was wondering if you had an update on this as I was hoping to replicate the faithfulness correlations in Table 4. It looks like there are three articles in Gaelic with ids 39553812, 39497668, and 40254741. Thanks!
Hi Alex, Sorry for the delay on this! I did not get slots to work on this properly. I will be adding this code soon. Please bear with me. Thanks!
Hello, awesome works, and congratulations! I'm wondering how to reproduce the numbers in Table 2.
First, I suppose the base number for Table 2 is 500, but multiply 500 with many percentages in Table 2 will result in decimals (e.g., the number of faithful summaries produced by BERTS2S = 500 * 26.9% = 134.5?). Can you explain the base number of this table?
Besides, I write a script following the instructions in Table 2:
However, the results seem to be different from Table 2... I'm not sure which part I misunderstood, could you provide your script?
Looking forward to your reply.
Many thanks, Yuhui
My script:
My results (here I treat I, E, I+E to three orthogonal categories and the I∪E in the Table should be the sum of I, E, I+E: