Hi, thanks for opening the dataset. I wonder that how is the evaluation performed? For example , how many candidate phrases would be there for one retrieval since you mentioned it in your paper (sec 3.4) and how to select the candidates. Maybe I missed it, could you give more information or the example code about the evaluation pipeline ?
Hi, thanks for opening the dataset. I wonder that how is the evaluation performed? For example , how many candidate phrases would be there for one retrieval since you mentioned it in your paper (sec 3.4) and how to select the candidates. Maybe I missed it, could you give more information or the example code about the evaluation pipeline ?