Quaffel / get-me-drunk-efficiently

Webservice providing cocktail recommendations based on how drunk you want to be
4 stars 0 forks source link

Improve recipe cleansing/pre-processing to mitigate poor data quality #51

Closed Quaffel closed 2 years ago

Quaffel commented 2 years ago

In addition to the data quality issues described in #50, some further problems pop up when carefully inspecting the results.

Quaffel commented 2 years ago

Current state of the investigation:

What catches the eye is that for the duplicate ingredients, the name of the ingredient is not only an ingredient but also a category.