dulibrarytech / digitaldu-frontend

Digital Collections DU front end
Apache License 2.0
3 stars 0 forks source link

Punctuation characters are removed by frontend #405

Closed jrynhart closed 9 months ago

jrynhart commented 9 months ago

Currently the punctuation characters (such as '-', '.') are removed by the frontend

Index will not match a string such as an (archives space) identifier "B.002.4321.2411" if the punctuation. Also, there have been issues with apostrophe in search terms:

From Kate on 5/5/23 (teams - LTS channel) "I just did a search for men's soccer (no quotes) and got 0 results. I then did a search just for the term soccer, and got 466, most of which have the phrase "men's soccer" in them. Ideas?"

Is the apostrophe in the index value preventing a hit if the search terms do not have an apostrophe? If so, index tokenizer might have to be added/updated in elastic

  1. allow search on ID such as "B.002.4321.2411"
  2. "men's soccer" should return same results as "mens soccer" and "men soccer"