-
when i run get_data.sh, i get below error.
./get_data.sh
--2024-02-05 15:14:36-- https://rocketqa.bj.bcebos.com/corpus/marco.tar.gz
Resolving rocketqa.bj.bcebos.com (rocketqa.bj.bcebos.com)... …
-
### Title
Document Expansion by Query Prediction
### Team Name
The Indexers
### Email
keyurdhanani456@gmail.com
### Team Member 1 Name
Keyur Dhanani
### Team Member 1 Id
202…
-
I have been using pre-trained cross-encoder/ms-marco-MiniLM-L-6-v2 on a dataset similar to MS-MARCO for re-ranking paragraphs based on a query/question. The top-3 accuracy results have been pretty goo…
-
## 一言でいうと
Microsoftが公開した質問応答のデータセット(10万件)。質問/回答が、人間のものである点が特徴(Bing=検索エンジンへの入力なのでどこまで質問っぽいかは要確認)。回答はBingの検索結果から抜粋して作成。
### 論文リンク
https://arxiv.org/pdf/1611.09268v1.pdf
### 著者/所属機関
Tri Ng…
-
Evaluating on MS-MARCO seems to take significantly a lot more time than NQ or Hotpot QA, i.e., it just hangs there:
> Loading checkpoint shards: 0%| | 0/2 [00:00
-
### System Info
py3.10
infinity-emb 0.0.55
Running with optimum engine fails:
```
INFO 2024-09-13 15:17:02,874 datasets INFO: PyTorch version 2.4.0 available. …
rawsh updated
1 month ago
-
Hi,
what would be the best way to finetune biobert for sentence embeddings?
Will training biobert on STS/msmarco datasets be a good approach to get domain specific sentence embeddings?
Thanks.
-
### Title
Document Ranking with a Pretrained Sequence-to-Sequence Model
### Team Name
Team DSSM
### Email
202311022@daiict.ac.in
### Team Member 1 Name
Pratham Patel
### Team M…
-
@x-tabdeveloping is working on the new leaderboard [here](https://github.com/embeddings-benchmark/mteb/pull/1235) with awesome progress towards making it customizable (e.g. "select your own benchmark"…
-
hi, thank you very much for your work. I would like to inquire whether the training dataset, MS MARCO, is available for provision, or if you could possibly provide a download link for it.