eulerto / pg_similarity

set of functions and operators for executing similarity queries
BSD 3-Clause "New" or "Revised" License
363 stars 38 forks source link

Possible Bug in Dice (and Jaccard) Function #34

Open momenator opened 2 years ago

momenator commented 2 years ago

Hi,

I tried running the dice function on the sample string

select('euler', 'heuser')

and I got 0 instead of 0.444... as specified in the example.

Using another example

select dice('hello there not', 'hello friend what')

It seems that the dice function computes over the words and not the n-grams specified in the comments. Is this the expected behavior?