openfoodfacts / openfoodfacts-server

Open Food Facts database, API server and web interface - 🐪🦋 Perl, CSS and JS coders welcome 😊 For helping in Python, see Robotoff or taxonomy-editor
http://openfoodfacts.github.io/openfoodfacts-server/
GNU Affero General Public License v3.0
658 stars 388 forks source link

Avoid duplicates in tags fields #8355

Open alexgarel opened 1 year ago

alexgarel commented 1 year ago

In https://de-en.openfoodfacts.org/product/4008577006186/br%C3%B6tchen-conditorei-coppenrath-wiese-unsere-goldst%C3%BCcke-dinkelbr%C3%B6tchen-6-st%C3%BCck?rev=118 we have a repetition of the brand. This is due to a mobile app.

I think we shall resolve those issues in Product Opener code, at the moment we fill _tags fields.

benbenben2 commented 1 year ago

Is it only for brands?

For duplicates in Brands, we may have around 200 products.

Sometimes, all variations of a brand can be listed (for example, kelloggs, kellogs, kellog's, kelogs, kelog's, etc.). This kind of duplicates could be solved with a taxonomy for Brands. But this is a bit off-topic.

alexgarel commented 1 year ago

@benbenben2 it's potentially for all "tags" fields.

github-actions[bot] commented 1 year ago

This issue has been open 90 days with no activity. Can you give it a little love by linking it to a parent issue, adding relevant labels and projets, creating a mockup if applicable, adding code pointers from https://github.com/openfoodfacts/openfoodfacts-server/blob/main/.github/labeler.yml, giving it a priority, editing the original issue to have a more comprehensive description… Thank you very much for your contribution to 🍊 Open Food Facts