OpenSextant / SolrTextTagger

A text tagger based on Lucene / Solr, using FST technology
Apache License 2.0
173 stars 37 forks source link

Field BoostFilter SearchComponent #54

Open dsmiley opened 8 years ago

dsmiley commented 8 years ago

I know that the SolrTextTagger is used by CareerBuilder to find interesting things in a user's query to then do other things (like boost or apply a filter). There is a cool Solr plugin by Ted Dunning at LucidWorks here: https://github.com/lucidworks/query-autofiltering-component that does this... although I have a bunch of concerns with it. Relevant blog: https://lucidworks.com/blog/2015/05/13/query-autofiltering-revisited-can-precise/

I think it would be cool to develop a SearchComponent similar to Ted's but based on the SolrTextTagger. It would build a "side-car index" (possibly held in memory -- configurable) and then use its results to either apply "fq" filter queries or dismax "bq" boost queries (or both). In the end, it should be much less code than Ted's and it should have it's analysis configurable via the Solr schema instead of being hard-coded.

Disclaimer: this is just an idea place-holder; I don't yet have plans to do this

dsmiley commented 8 years ago

What I'm not sure about is the rule/pattern part as described here https://lucidworks.com/blog/2015/11/19/query-autofiltering-chapter-4-a-novel-approach-to-natural-language-processing/ -- need to think about that more. Any way, it's not essential.