john-hu / untitled

0 stars 0 forks source link

yummly bug #59

Open john-hu opened 2 years ago

john-hu commented 2 years ago

https://www.yummly.co.uk/recipe/Mojito-Pork-Roast-2255241

The amount contains text:


    for r in iterable:
  File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/spidermiddlewares/urllength.py", line 40, in <genexpr>
    return (r for r in result or () if _filter(r))
  File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/core/spidermw.py", line 56, in _evaluate_iterable
    for r in iterable:
  File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/spidermiddlewares/depth.py", line 58, in <genexpr>
    return (r for r in result or () if _filter(r))
  File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/core/spidermw.py", line 56, in _evaluate_iterable
    for r in iterable:
  File "/home/pi/git/untitled/peeler/scrapy_utils/base_spiders.py", line 81, in parse
    raise ex
  File "/home/pi/git/untitled/peeler/scrapy_utils/base_spiders.py", line 68, in parse
    item = self.parse_response(response)
  File "/home/pi/git/untitled/peeler/yummly/spiders/recipe_result.py", line 89, in parse_response
    item.ingredients = self.parse_ingredient(response)
  File "/home/pi/git/untitled/peeler/yummly/spiders/recipe_result.py", line 35, in parse_ingredient
    ingredient_item['size'] = {'number': float(number_text)}
ValueError: could not convert string to float: 'Less than'```
john-hu commented 2 years ago

It may also contain divider

john-hu commented 2 years ago

Mark the record as wrong data. Let's try to pick it up later.