fpdcc / document-search

Search and management interface for spatial documents at the Forest Preserves of Cook County.
2 stars 0 forks source link

normalizing or correcting data and/or how searching interprets upper and lower case strings #78

Open glw opened 3 years ago

glw commented 3 years ago

Probably both need to be done. Currently terms like "Des Plaines Division Headquarters" and "DES PLAINES DIVISION HEADQUARTERS" are treated as separate terms, when in reality they are the same.

Cleaning of the data (standardizing names, locations, dates, etc.) should be done. and possibly adjustments to how Solr (?) interprets lower and upper case terms/names.

image

smcalilly commented 1 year ago

i think we can fix this by overriding the django-haystack prepare? nope not that easy!

smcalilly commented 1 year ago

there are a few things we need to do for this:

  1. figure out all of the models and model fields that have this issue
  2. one-time data cleanup
  3. add a dropdown for these values so users can't manually enter them
  4. allow admin users to CRUD these values