Closed Stefan4472 closed 2 years ago
Added this option in the AlphanumericTokenizer
. I'm not sure if this is really something we'd want to keep long term, because from some quick research, it may be ineffective: https://nlp.stanford.edu/IR-book/html/htmledition/capitalizationcase-folding-1.html
Look into this. I think queries should be case-insensitive (i.e., store all tokens as lowercase!)