openfoodfacts / openfoodfacts-ai

This is a tracking repo for all our AI projects. 🍕 🤖🍼
212 stars 50 forks source link

feat: Spellcheck benchmark dataset and evaluation algorithm #340

Closed jeremyarancio closed 2 weeks ago

jeremyarancio commented 2 months ago

What

Creation of the benchmark and evaluation algorithm to evaluate the spellcheck

Benchmark

The benchmark is composed of 247 lists of ingredients from 3 data sources:

Argilla to validate benchmark

Lists of ingredients corrected with GPT-3.5 are checked and modified to respect the spellcheck guidelines.

Evaluation algorithm

An evaluation algorithm is created to estimate the performance of the Spellcheck. It calculates the Precision-Recall of the correction based on text sequences (Original-Reference-Prediction) by using tokenization and alignment algorithm.