In Semantic Textual Similarity training, you're using STSbenchmark dataset, which has two narratives and a score from 0 to 5 to indicate the similarity between the two narratives.
I have a large dataset that contains only two narratives. The two narratives are considered to be talking about the same idea(They all score 5 out of 5).
How can I train the model on a dataset where all entries have the score 5?
In Semantic Textual Similarity training, you're using
STSbenchmark
dataset, which has two narratives and a score from 0 to 5 to indicate the similarity between the two narratives.I have a large dataset that contains only two narratives. The two narratives are considered to be talking about the same idea(They all score 5 out of 5).
How can I train the model on a dataset where all entries have the score 5?