I try to embed some sentences that have 5-20 number of tokens and have little syntactic differences. Also, I collected a corpus to train for that specific domain.
I think, -loss ns and selecting appropriate threshold of sampling -t to select "negative hard mining samples" are important.
What are the best hyperparameters to use these kind of case which focuses on text similarity?
Hi,
I try to embed some sentences that have 5-20 number of tokens and have little syntactic differences. Also, I collected a corpus to train for that specific domain.
I think,
-loss ns
and selecting appropriate threshold of sampling-t
to select "negative hard mining samples" are important.What are the best hyperparameters to use these kind of case which focuses on text similarity?