Closed omidb closed 8 years ago
After a quick glance at the WordNGramJaccardMeasure, I'd say that it returns zero because there is no common 3-gram between the two sentences you are using as examples.
ah, there is one... sorry. "short example text"
No, I was right ;) There is no common ngram. If you split the first sentence, you get text.
(with a trailing full stop) but in the second sentence, you get text
.
@omidb Maybe the example needs to be fixed. At which URL did you find it?
I found it here: https://dkpro.github.io/dkpro-similarity/
What are the similarity measures that you found work better.
I updated the example code.
There is no way of saying in general which measure works best. It depends on the context of your. For questions, best try the users mailing list: https://groups.google.com/forum/#!forum/dkpro-similarity-users
Hey Omid its Farbod from Germany,
hope you are well last time we saw each other was Hamkafe Bargh MRL Nao, QIAU 💃
I am using Dkpro also, but I don't know about the parameter. trigrams
TextSimilarityMeasure measure = new WordNGramJaccardMeasure(3); // Use word **trigrams**
Also, want to ask you if you know if this is the semantic similarity _ mean it uses ESA to each ten in Wikipedia to extract semantic or not. by the way, I could run it there new maven 2.3
thanks, bro
Hi,
I'm trying to use your example in the website:
The return value is 0.0, am I missing something?