Closed tomtang110 closed 5 years ago
Hi, are you sure you're using our script on a complete output file (an example of predictions on the dev set can be found on the website)? Our script should print out a JSON object containing various metrics, and your output looks very different from it.
Hi I run my results in the eval() in hotpot_evaluate_v1.py, however, the result may be not the same with your scores in leaderboard. Could you tell me the correct function to evaluate?