-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Usage for semantic search #47
Comments
You're using the Cross-Encoder, correct? |
No, I am using Asymmetric Semantic Search Bi-encoder. |
I see, so you're saying that the cosine similarity for |
Yes. The confidence levels are so similar that I cannot put a threshold level for differentiating them. |
Hm what model are you using? I'd recommend switching to a bigger / better one, specifically I'd recommend this one: https://huggingface.co/GritLM/GritLM-7B |
I am using this model: |
Hello,
I want to create a semantic search functionality. The model is accurate in calculating the similarity between the word synonyms. Here are a few excerpts of the demo:
Supposedly, my dataset has the following lines:
And my prompts:
The confidence value for each of the prompts ranges between 0.25 and 0.4. The issue I am facing is, that the model gives the same confidence value for wrong and right prompt outputs as seen in prompt 4 output. I want to show "No results found" if the given word is not in the dataset.
How do I solve this issue and make it more efficient? Thank you in advance.
The text was updated successfully, but these errors were encountered: