openfoodfacts / openfoodfacts-server

Open Food Facts database, API server and web interface - 🐪🦋 Perl, CSS and JS coders welcome 😊 For helping in Python, see Robotoff or taxonomy-editor
http://openfoodfacts.github.io/openfoodfacts-server/
GNU Affero General Public License v3.0
658 stars 387 forks source link

Dutch "Pecorino Romano" ingredient disappears #4337

Closed svensven closed 4 years ago

svensven commented 4 years ago

This record has "pecorino romano kaas" has an ingredient, but the parser just disappears it, and assigns its subingredients to the previous ingredient. https://nl.openfoodfacts.org/product/23026520/pesto

image

vs

image

https://nl.openfoodfacts.org/cgi/test_ingredients_analysis.pl?ingredients_text=zout%2C+Pecorino+Romano%2C+Pecorino+Romano+kaas%2C+Grana+Padano%2C+Grana+Padano+kaas&type=add&action=process&.submit=Submit+Query

stephanegigandet commented 4 years ago

Added some traces to figure out what's going on:

debug: unknown ingredient matches a phrase to ignore {ingredient => "Pecorino Romano",regexp => "o.a."}

I'm changing the regexp to 'o.a.'