namhnguyen / asterixdb

Automatically exported from code.google.com/p/asterixdb
0 stars 0 forks source link

Combine set similarity and fuzzy string matching #721

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
We can compare strings fuzzily and we can compare Jaccard similarity.

It would be nice to combine the two, using fuzzy string matching to determine 
equivalence of strings to feed into the Jaccard similarity measure - ideally 
having the same index support that we have today.

Original issue reported on code.google.com by westm...@gmail.com on 7 Mar 2014 at 6:19

GoogleCodeExporter commented 9 years ago
Adding a paper that studies this problem: 
http://tods.acm.org/accepted/2013/WangExtending.pdf

Original comment by icetin...@gmail.com on 22 Apr 2014 at 11:53