openfoodfacts / openfoodfacts-server

Open Food Facts database, API server and web interface - 🐪🦋 Perl, CSS and JS coders welcome 😊 For helping in Python, see Robotoff or taxonomy-editor
GNU Affero General Public License v3.0
633 stars 371 forks source link

Ingredients lists with missing spaces after dots are not parsed correctly, leading to false positives #4948

Open teolemon opened 3 years ago

teolemon commented 3 years ago

When clicking on https://fr.openfoodfacts.org/ingredient/arome-de-cacao, “de” is removed. As a result, we get products which do not have "arôme de cacao" as an ingredient, but rather "arome. cacao minimum:12%"

Part of

Screenshot

image

stephanegigandet commented 3 years ago

It's a parsing issue related to dots, "arôme.cacao" is not separated. "arôme. cacao" is separated. But it's intended to be that way. We can try to handle ingredients lists with missing spaces after dots, but we will need to be careful not to break something else in the process.

github-actions[bot] commented 7 months ago

This issue has been open 90 days with no activity. Can you give it a little love by linking it to a parent issue, adding relevant labels and projets, creating a mockup if applicable, adding code pointers from https://github.com/openfoodfacts/openfoodfacts-server/blob/main/.github/labeler.yml, giving it a priority, editing the original issue to have a more comprehensive description… Thank you very much for your contribution to 🍊 Open Food Facts