openculinary / knowledge-graph

The RecipeRadar knowledge graph stores and provides access to recipe and ingredient relationship information.
GNU Affero General Public License v3.0
10 stars 0 forks source link

Handle unicode characters during ingredient name highlighting #42

Closed jayaddison closed 4 years ago

jayaddison commented 4 years ago

Describe the reason for these changes and the problem that they solve

This changeset introduces support for markup highlighting of ingredient names that contain non-ASCII characters. This has surfaced as an issue in openculinary/frontend#162.

Briefly summarize the changes

  1. Upgrade to a version of hashedixsearch that supports highlighting of non-ASCII terms
  2. Perform transliteration during stemming of ingredient terms so that an ASCII version of each ingredient name and query is used to term matching purposes (i.e. allow jalapeno to match against jalapeño)

How have the changes been tested?

  1. The underlying hashedixsearch library's unit test coverage demonstates that this is possible
  2. Unit test coverage is provided
  3. Local development testing has been performed

List any issues that this change relates to Relates to openculinary/hashedixsearch#23 Resolves openculinary/frontend#162