The huggingface checkpoints do not pass the sanity check

Hi, our method freezes all transformer parameters and only tunes the additional soft prompt. The checkpoint "YuxinJiang/sup-promcse-roberta-base" contains the parameters of the freezed roberta-base as well as the parameters of the soft prompts. Directly loading our model checkpoint using _sentencetransformers will only load the parameters of roberta-base. You may check it by runing

from sentence_transformers import SentenceTransformer
sentences = ["How is data encrypted in transit?", "Does your application use a firewall?"]
model = SentenceTransformer("roberta-base")
embeddings = model.encode(sentences)
#Compute cosine-similarities
cosine_scores = util.cos_sim(embeddings[0], `embeddings[1])
cosine_scores

and the cosine_scores would also be tensor([[0.9787]]).

Though our models do not fit sentence_transformers, we have released an easy-to-use python package promcse (https://pypi.org/project/promcse/), which provides functions of

(1) encode sentences into embedding vectors; (2) compute cosine simiarities between sentences; (3) given queries, retrieval top-k semantically similar sentences for each query.

You can also get a quick start at .

Thanks a lot for your question! I sincerely hope our work can benefit more people :)

YJiangcm / PromCSE

The huggingface checkpoints do not pass the sanity check #11

Compute cosine-similarities