-
### Do you need to file an issue?
- [ ] I have searched the existing issues and this bug is not already filed.
- [ ] My model is hosted on OpenAI or Azure. If not, please look at the "model providers…
-
Is it possible to feed in an image and a text into pali to calculate an image-text similarity score? On the readme, I see the prompt is also being fed into the model
## Upvote & Fund
- We're using …
-
I have been observing low cosine similarity scores for InternVideo2 video embeddings compared to relevant text caption embeddings. In some cases, the scores are even negative. I am not sure if I am mi…
-
Getting
`ApiError: status_code: 401, body: {'detail': 'Not authenticated'}`
despite all variables are set propperly.
Exploring file_retrieve_workflow.ipynb and constantly getting this error. Any…
-
The original run.py saves the model in pytorch_model.bin, which cannot be loaded directly using the code provided in this repository. After replacing line 422 `trainer.save_model()` in training/run.py…
-
Add built in functions to compare texts in popular algorithms:
- Jaro
- Jaro-Winkler
- Levenshtein
- Damerau-Levenshtein
- Hamming Distance (simple i know)
- Cosine Similarity
- ....
-
Hello! I have a few questions and observations regarding the deduplication approach using MinHash in this repository. Specifically, I’m interested in some intuition around handling false positives and…
-
**Methods**
1. [Sanjeev Arora paper](https://openreview.net/pdf?id=SyK00v5xx)
2. sentence BERT
3. tf-idf
4. Jaccard and bigram Jaccard (on the fly)
### For reddit preprocessing
1. [ ] Ma…
-
用的示例代码
```
from FlagEmbedding import FlagAutoModel
import argparse
import json
import random
import numpy as np
import faiss
from tqdm import tqdm
from FlagEmbedding import FlagModel
if …
-
- Word2vec
- GloVe: Global Vectors for Word Representation - https://nlp.stanford.edu/projects/glove/
Find out on google - `text similarity python`