thegooglecodearchive / allforgood

Automatically exported from code.google.com/p/allforgood
0 stars 0 forks source link

need to refine stop words and custom synonym related to key partners #561

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Current stop words files includes words like "on" which I would think decrease 
the probability of making a match when performing a search for "hands on".  
Solr I believe only looks at the term "hand" and stemming variants there of.

Original issue reported on code.google.com by danstryk...@gmail.com on 8 Jan 2011 at 6:45

GoogleCodeExporter commented 9 years ago
[deleted comment]
GoogleCodeExporter commented 9 years ago
[deleted comment]
GoogleCodeExporter commented 9 years ago
[deleted comment]
GoogleCodeExporter commented 9 years ago
See recommendation to remove "on" from stopwords to ensure that "Hands On" 
ranks properly
http://code.google.com/p/allforgood/source/browse/trunk/SOLR/app/conf/stopwords.
txt?r=471

Original comment by danstryk...@gmail.com on 9 Jan 2011 at 6:32

GoogleCodeExporter commented 9 years ago

Original comment by danstryk...@gmail.com on 10 Jan 2011 at 12:26

GoogleCodeExporter commented 9 years ago
May also make sense to add a custom synonym for "handson" and "hands on"

Original comment by danstryk...@gmail.com on 10 Jan 2011 at 12:43

GoogleCodeExporter commented 9 years ago

Original comment by danstryk...@gmail.com on 10 Jan 2011 at 12:43

GoogleCodeExporter commented 9 years ago

Original comment by danstryk...@gmail.com on 10 Jan 2011 at 12:44

GoogleCodeExporter commented 9 years ago

Original comment by danstryk...@gmail.com on 10 Jan 2011 at 12:44

GoogleCodeExporter commented 9 years ago

Original comment by danstryk...@gmail.com on 10 Jan 2011 at 3:44

GoogleCodeExporter commented 9 years ago
also may want to try removing "for" from stopwords to increase search accuracy 
for "habitat for humanity".  

Be good if could make sure somehow that "humanity" does NOT match against 
"human".  Causes "human resources" to match against "habitat for humanity"

Should add a custom synonym for "handson" "hands on"

Original comment by danstryk...@gmail.com on 11 Jan 2011 at 8:36

GoogleCodeExporter commented 9 years ago

Original comment by danstryk...@gmail.com on 11 Jan 2011 at 8:37

GoogleCodeExporter commented 9 years ago
Moved stopword functionality to Solr. Removed on and for from the list. I also 
added humanity and human to the protwords.txt file. This file contains a list 
of words that won't get stemmed which will stop searches for human resources to 
show habitat for humanity.

Original comment by jwdemp...@gmail.com on 20 Jan 2011 at 4:48