-
Hi,
For the warm-up step, I see a regular dense retrieval model training on the triples.small data provided by MSMarco.
But I don't find any code introducing bm25 index and bm25 sampling.
I gue…
-
When trying to load the pretrained ESIM model for sentence retrieval I get the following error:
```
Exception has occurred: NotFoundError
Key encode_rnn/birnn/bidirectional_rnn/fw/basic_lstm_cell…
-
作者你好,最近在复现你们的工作“Making Large Language Models A Better Foundation For Dense Retrieval”,但是在模型训练过程中发现了模型塌缩,loss降了5个点后就不降了,同时对所有句子编码后的embedding,计算相似度几乎为1。想问一下在处理ebar和ebae两个任务的label的时候是否进行了一些特殊处理呢?我的理解是句子中…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
Using llama-index-vector-stores-elasticsearch, with index, how to enable hybrid search?
…
-
Here is the Google Colab link I used for fine-tuning :
[https://colab.research.google.com/drive/1kiALBR1UarPobiftZmiHfwFyk7hTCDnV?usp=sharing](url)
When I fine-tune the LLM-embed for tool retriev…
-
Hello! I am trying to build a recommendation system using tfrs. I managed to build a retrieval model using the [Retrieval with Sequential Model](https://www.tensorflow.org/recommenders/examples/sequen…
-
想问下,我在用evaluate的过程中,先执行的对bge-m3的dense retrieval的评估,然后pooling_method不管选mean还是cls,recall20 100的分数都是0.0,非常奇怪,参数没有修改,但是用了自己的qa数据集
1. Generate Corpus Embedding
python step0-generate_embedding.py --enco…
-
# Environments
Python: 3.9
OS: Ubuntu 20.04
FlagEmbedding 1.2.5
transformers 4.33.1
# Details
my test python file `bge-test.py`:
```
from FlagEmbedding import BGEM3F…
-
i get such error
Format error in JSON body: invalid type: string "10979dc7e0-9895-4b97-90d8-e7a0beb21e0b", expected usize at line 1 column 157
i make
`
text_id = str(uuid.uuid4…
-
How does the performance of this fine-tuned model on Zero-shot Classification and Zero-shot Cross-Modal Retrieval?