I'm trying to fuse 5 CSV files generated by 5 ranking models, after reading the documentation(example 6) I still don't know how to achieve it, could you please help me with it?
For example, file model_1.csv:
search_id,item_id
1,45
1,3
1,4
1,90
2,5
2,54
2,76
and file model_2.csv:
search_id,item_id
1,45
1,4
1,3
1,78
2,5
2,93
2,54
Note: different models may return a different set of item_id for an individual search,(e.g., item_id 90 appears in model_1 for search_id 1, but not in model_2 for search_id 1), and every model has a validation score(NDCG), does the val score help? How should I use it(as weight maybe)?
Could you provide some example code to show how I can achieve this(how to read the csv files as a TrecRun and fuse them, using the validation score as weight if it's possible)? And what kind of fusion is appropriate for this kind of task? (It's about fusing the ranking of the results of a search of hotels)
Great library, thanks for all the excellent work!
I'm trying to fuse 5 CSV files generated by 5 ranking models, after reading the documentation(example 6) I still don't know how to achieve it, could you please help me with it?
For example, file
model_1.csv
:and file
model_2.csv
:Note: different models may return a different set of item_id for an individual search,(e.g.,
item_id
90 appears inmodel_1
forsearch_id
1, but not inmodel_2
forsearch_id
1), and every model has a validation score(NDCG), does the val score help? How should I use it(as weight maybe)?Could you provide some example code to show how I can achieve this(how to read the csv files as a TrecRun and fuse them, using the validation score as weight if it's possible)? And what kind of fusion is appropriate for this kind of task? (It's about fusing the ranking of the results of a search of hotels)