-
Hi
I'd like to suggest adding the option of document-level covariate (similar to STM in R). Basically it allows the user to investigate relationship between document-level covariates such as source…
-
@quevon24 can you take this and fill in the gaps for me. I want to use this issue to both understand the mismatches you found in the matched columbia file, and so I can diagnose where and how it happ…
-
Hi.
I started using black magic probe (stlink clone converted) some times ago.
The main reason is because for every type of microntroller/manufacturer we have a different programmer/debugger and it…
-
The `llm similar` and `collection.similar()` methods currently implement the slowest brute-force approach.
I want to support faster approaches for this, like [sqlite-vss](https://github.com/asg017/…
-
I am trying to use a custom scoring function instead of the default BM25. For example, I wish to turn off IDF and only use the numerator in the BM25 score. What's the best way to achieve that? Would I…
-
**Description:** Research different similarity metrics (e.g., cosine similarity, Euclidean distance, etc.) for comparing image and text embeddings. Analyze their strengths, weaknesses, and suitability…
-
Hi, I'm trying to reimplement your code DSSM in query-doc screenario. However, I found that your code is not suitable in my screenario, for my dataset is in this form
So I need to construct a mapping…
ylqfp updated
8 years ago
-
Hi,
I am trying zero-shot topic modelling with BERTopic. The following fit_transform ran successfully:
```
topic_model = BERTopic(
embedding_model="thenlper/gte-small",
min…
-
Brahmi-derived and arabic script based orthographies have visual graphemes that look the same but have different underlying code points. Some of these are precomposed and decomposed pairs for which U…
-
**Due date = 10/17/23**