-
Hey team,
I am looking for ways to speed a cross-encoder (`cross-encoder/ms-marco-MiniLM-L-6-v2`) on a CPU. For a variety of reasons, GPUs are off-the-table for now.
We have the following setup:…
-
Thanks for the nice work!
I get the following error, using a fresh environment. It appears to be something going wrong in AllenNLP, but I couldn't get to the bottom of it with a bit of searching an…
-
Hi,
I really like your project as it provides an easy-to-use approach. I have been thinking that since the new Llama 3.1 is multilingual, could this approach also be used in that way? As we are on…
-
当我cd dataprocess目录下,执行bash ./run.sh命令时候发现,好像缺失了不少文件啊,可以说明一下这是什么原因吗?
缺失文件如下:在dataprocess下
gen_dynamic_centrality_weight.py会提示缺少./weights/bert.txt,这个py文件输入数据是demo.json
gen_dynamic_distance_weight.py会…
-
# BEIR
Beir is a benchmark of information retrieval. It was used for benchmarking Azure AI Search at Microsoft tech community - [Azure AI Search: Outperforming vector search with hybrid retrieval and…
-
**Nov 11**
* [ ] support new features on cloud https://github.com/lancedb/sophon/pull/2783 https://github.com/lancedb/sophon/pull/2765
* [ ] Document how to configure, debug, and update indices http…
-
Hello, thank you for sharing your awesome work.
I have a question related to arguments in denoising.py.
python denoising.py \
--task_name TASK_NAME \
--train_file PATH_TO_TRAIN_FILE \
…
-
Hii everyone!
I wanted to know the exact training procedure/script for training a **Japanese** bi-encoder for asymmetric search.
I am planning to use the translated version of ms-marco : https:/…
-
as the note
of MultipleNegativeRankingLoss.py say
"You can also provide one or multiple hard negatives per anchor-positive pair by structering the data like this:
(a_1, p_1, n_1), (a_2, p_…
-
Hi,
1/ Is it possible to fine tune a multilingual bi-encoder on specific domain data using unsupervised synthethic query generation? If yes, are performance comparable to fine-tuning with supervised …