openculinary / knowledge-graph

The RecipeRadar knowledge graph stores and provides access to recipe and ingredient relationship information.
GNU Affero General Public License v3.0
10 stars 0 forks source link

Improve liquid smoke handling #10

Closed jayaddison closed 4 years ago

jayaddison commented 4 years ago

The term smoke has been categorized as a stop-word by the ingredient ingestion process; this changeset adds it to the list of clear-words so that it can be considered a valid product name token.

Fixing this highlighted that during product merges (implemented via the Python add operator on the Product class), the winning product name was arbitrary; this change adds a rule which prefers the shorter production description, so that a merge of hickory liquid smoke and liquid smoke would pick the latter product name for the merged result.

Resolves #7