fani-lab / RePair

Extensible and Configurable Toolkit for Query Refinement Gold Standard Generation Using Transformers
5 stars 5 forks source link

Rag Fusion #51

Open DelaramRajaei opened 4 months ago

DelaramRajaei commented 4 months ago

This is the issue where I log the progress of reg fusion in RePair project.

DelaramRajaei commented 4 months ago

Hello @hosseinfani,

I wanted to provide you with an update on the project.

I've uploaded all the recent code updates to the NQLB branch on GitHub. You can find them here.

Additionally, I've included the results from the robust04 datasets on rag fusion. You can access them through this link.

The comparison column is based on different categories:

The analysis covers two IR rankers (BM25 and QLD) and evaluation metrics (MAP, MRR, NDCG).

The "#" indicates the number of refined queries that achieved the best evaluation score within each category of refiners.

Currently, I'm experimenting with other datasets. In the meantime, I'm considering focusing on comparing the original evaluation results with the rag fusion in each category.

I've also created a chart to visualize the results. I would appreciate your suggestions on how to improve it further.

hosseinfani commented 3 months ago

Hi @DelaramRajaei , Thanks for the update and the code merge!

Looking at the excel sheet, I'm not sure I could understood the comparison. Are you available today (Thursday) to have a quick meeting?

Just a quick request: your results are spread over different gsheets, right? Would you please come up with a better management of the result files? We'll talk :)

DelaramRajaei commented 3 months ago

Hi @hosseinfani,

Yes, I am available online today, at any time.

I've been having trouble finding a good solution for the chart. As for the tables, each Google Sheet represents a dataset. Another way to compare them can be by comparing datasets that have the same ranker.metric values.