idilhaq / duke

Automatically exported from code.google.com/p/duke
0 stars 1 forks source link

Fuzzy search in Lucene #26

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Apparently it's possible to do fast fuzzy searches in Lucene 3.x. Need to find 
out how. Keywords are "ngram index" and "spellcheck". Haven't found anything 
yet, but need to see if there is a way to do this.

Original issue reported on code.google.com by lar...@gmail.com on 24 Aug 2011 at 8:52

GoogleCodeExporter commented 9 years ago
http://blog.mikemccandless.com/2011/03/lucenes-fuzzyquery-is-100-times-faster.ht
ml

Original comment by ashwin.j...@gmail.com on 24 Aug 2011 at 10:43

GoogleCodeExporter commented 9 years ago
Hi there. Yes, I'm very much looking forward to this change. Unfortunately, it 
won't be released before Lucene 4.0, so I'm trying to see if there exists a 
solution for 3.0 that I can use in the meantime.

Original comment by lar...@gmail.com on 25 Aug 2011 at 6:29

GoogleCodeExporter commented 9 years ago
Looks like release 3.3 might have some relevant stuff: 
http://lucene.apache.org/#1+July+2011+-+Lucene+Core+3.3+and+Solr+3.3+Available

Original comment by lar...@gmail.com on 25 Aug 2011 at 7:29

GoogleCodeExporter commented 9 years ago
Think this could be merged with issue 84 > Lucene 4 has fuzzy search 100 times 
faster  + it is now GA.

Original comment by yann.bar...@gmail.com on 14 Jan 2013 at 9:32

GoogleCodeExporter commented 9 years ago
Yes and no. First we need to fix issue 84, which would give us 4.0 support. 
Then we can implement this. Will do it as soon as I can. (Patches welcome. :-)

Original comment by lar...@gmail.com on 14 Jan 2013 at 9:36

GoogleCodeExporter commented 9 years ago

Original comment by lar...@gmail.com on 1 Feb 2013 at 6:39

GoogleCodeExporter commented 9 years ago
This is how to do it: 
http://lucene.apache.org/core/4_2_0/core/org/apache/lucene/search/FuzzyQuery.htm
l

Edit distance defaults to 2, which is the max. Shouldn't take too long to code 
this up.

Original comment by lar...@gmail.com on 16 Mar 2013 at 11:39

GoogleCodeExporter commented 9 years ago
This is implemented now. Turned out to be dead easy, and slow.

Original comment by lar...@gmail.com on 14 Feb 2014 at 8:10