q-m / food-ingredient-parser-ruby

Extract the structure of ingredient lists on food products
MIT License
16 stars 2 forks source link

Mark in front confuses loose parser #7

Closed wvengen closed 6 years ago

wvengen commented 6 years ago

With listed ingredients *Kappertjes (58%), *wijnazijn (21%), water, zout, *uit de biologische landbouw., the loose parser returns the full ingredients list as first ingredient.

wvengen commented 6 years ago

There are also other cases where the loose parser returns the full ingredients list as first ingredient, e.g. (1%) foo, bar.

wvengen commented 6 years ago

Fixed. Note that marks in front of ingredients are generally recognized as the start of the notes.