Open rut00 opened 4 months ago
You're using the Cross-Encoder, correct?
No, I am using Asymmetric Semantic Search Bi-encoder.
I see, so you're saying that the cosine similarity for what is python language
and Photosynthesis is the process by which green plants and s...
is as high as the other ones?
Yes. The confidence levels are so similar that I cannot put a threshold level for differentiating them.
Hm what model are you using? I'd recommend switching to a bigger / better one, specifically I'd recommend this one: https://huggingface.co/GritLM/GritLM-7B
I am using this model: SGPT-125M-weightedmean-msmarco-specb-bitfit
and I will try the recommended model.
Hello, I want to create a semantic search functionality. The model is accurate in calculating the similarity between the word synonyms. Here are a few excerpts of the demo: Supposedly, my dataset has the following lines:
And my prompts:
The confidence value for each of the prompts ranges between 0.25 and 0.4. The issue I am facing is, that the model gives the same confidence value for wrong and right prompt outputs as seen in prompt 4 output. I want to show "No results found" if the given word is not in the dataset.
How do I solve this issue and make it more efficient? Thank you in advance.