Closed jarheadjoe closed 2 months ago
When using Acc in dureader, gpt-3.5-16k's score is very low, this is my self test. This is inconsistent with the results in the appendix of your paper
Thank you for your suggestion. We have changed the metric to rouge instead in the updated version.
When using Acc in dureader, gpt-3.5-16k's score is very low, this is my self test. This is inconsistent with the results in the appendix of your paper