-
``` python
from rank_bm25 import BM25Okapi
corpus = [
"Hello there good man!",
"It is quite windy in London"
"How is the weather today?"
]
tokenized_corpus = [doc.split(" ") for…
-
Following
https://github.com/quickwit-oss/search-benchmark-game/issues/45
we might want to update the BM25 constants to k1=0.9 and b=0.4
I don't think we want to make it configurable yet.
(We h…
-
Hello!
Thanks for RAGbuilder, it's a very nice project.
I am using the custom RAG configuration, but even if I explicitly mention that I only want to use "Vector DB - Similarity Search" as a retri…
-
Q1:The paper said BM25 Retriever is the initial model.Do you mean use the cross-encoder is used to tune BM25 retriever?
Q2:In Section 4.2 what's the function of s(x,y,xi,yi)?In Section 4.3 what's the…
-
`rank-bm25==0.2.2`
```
In [11]: rank_bm25.BM25(corpus=[])
---------------------------------------------------------------------------
ZeroDivisionError Traceback (most re…
mattf updated
1 month ago
-
Thanks for this awesome library.
I am curious to know whether rank_bm25 can handle 500K documents. Each document has around 1000 words.
Looking forward to your feedback. I want to use the follow…
-
-
### Describe your feature request
Currently, performing a search with `indexSearchable` disabled causes Weaviate to return no results.
It would be clearer if it threw an error instead, as returni…
-
What's the recommended way of performing BM25 ranking search with sqlite-utils? I found a mention of `rank_bm25()` in the [docs](https://sqlite-utils.datasette.io/en/stable/python-api.html#building-sq…
-
### Feature Description
It seems there is no way to add metadata filters when initializing a BM25Retriever object. I am wondering if it would be possible to add this feature.
### Reason
No idea why…