-
[ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT](https://dl.acm.org/doi/10.1145/3397271.3401075)
## Abstract
Recent progress in Natural Language Un…
-
Build a model for ranking clarifying questions given an instruction.
See [What to ask](https://www.aicrowd.com/challenges/neurips-2022-iglu-challenge/problems/neurips-2022-iglu-challenge-nlp-task#eva…
-
Per discussion here: https://github.com/microsoft/MSMARCO-Passage-Ranking/commit/4695a71c6c76ce85c07a51c0f12690cab19abbb0
The current version of `qidpidtriples.train.full.2.tsv.gz` has the same rec…
-
Hello,
I'm trying to use colbert.text_scorer() to do the rerank in the pipeline, but it seems that there is no option for me to include the metadata, and the output of `colbert.text_scorer()` only…
-
I noticed that there's a sizeable number of passages in the v2 corpus that have text that exactly matches other passages: ~27.8 million passages, which amounts to around 20% of all passages in the cor…
-
[Baselines](https://github.com/microsoft/MSMARCO-Passage-Ranking/tree/master/Baselines)/[data](https://github.com/microsoft/MSMARCO-Passage-Ranking/tree/master/Baselines/data)
/downloaddata.sh
the…
-
Hi~I am trying to reproduce the results of RepLLaMA. I have an a800 GPU. If I start training RepLLaMA from scratch with your code, it may take 80 hours? I want to know if this is normal? If possible, …
-
And can you provide a eval code that reproduces these cross Encoder MRR@10 results on MS Marco Dev?
https://sbert.net/docs/pretrained-models/ce-msmarco.html
![image](https://user-images.githubuserco…
-
Hi @potsawee , thanks for sharing your awesome work.
However, when trying to run your code, I found that even though there is an n-gram model, there are no examples of its usage provided. The n-gra…
-
Hello, have you ever run CEDR_KNRM on MSMARCO document ranking task?
I encountered some problems when I trained CEDR_KNRM initialized with the fine-tuned BERT (the performance almost no longer increa…