Open bsmmoon opened 7 years ago
VSM + 3-gram?
Phrasal queries are 2 or 3 words long, max; so you if you are able to deal with phrasal queries, you can support them using n-word indices or with positional indices.
Assuming that indexing is done using 3-gram, how to handle query with less than 3 tokens? Wildcard?
If we use multiple n-gram indexes (1,2,3-gram), how to handle mixed query like '2 words' AND '3 words'
?
Use 1,2,3 gram models. Given phrase with length n, use n-gram first. If not enough document retrieved, try lower grams as well.
It should be..