john-hu / untitled

0 stars 0 forks source link

improve general parse #82

Closed john-hu closed 2 years ago

john-hu commented 2 years ago

2021-12-08 20:27:54 [scrapy.core.scraper] ERROR: Spider error processing <GET https://www.101cookbooks.com/archives/an-exceptional-salad-with-an-unusual-coconut-oil-dressing-recipe.html> (referer: None) Traceback (most recent call last): File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/utils/defer.py", line 120, in iter_errback yield next(it) File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/utils/python.py", line 353, in next return next(self.data) File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/utils/python.py", line 353, in next return next(self.data) File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/core/spidermw.py", line 56, in _evaluate_iterable for r in iterable: File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/spidermiddlewares/offsite.py", line 29, in process_spider_output for x in result: File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/core/spidermw.py", line 56, in _evaluate_iterable for r in iterable: File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/spidermiddlewares/referer.py", line 342, in return (_set_referer(r) for r in result or ()) File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/core/spidermw.py", line 56, in _evaluate_iterable for r in iterable: File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/spidermiddlewares/urllength.py", line 40, in return (r for r in result or () if _filter(r)) File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/core/spidermw.py", line 56, in _evaluate_iterable for r in iterable: File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/spidermiddlewares/depth.py", line 58, in return (r for r in result or () if _filter(r)) File "/home/pi/git/untitled/env/lib/python3.7/site-packages/scrapy/core/spidermw.py", line 56, in _evaluate_iterable for r in iterable: File "/home/pi/git/untitled/peeler/scrapy_utils/spiders/generator_base.py", line 65, in parse raise ex File "/home/pi/git/untitled/peeler/scrapy_utils/spiders/generator_base.py", line 35, in parse for item in self.yield_results(response): File "/home/pi/git/untitled/peeler/general/spiders/general_result.py", line 74, in yield_results recipe_language = self.parse_html_language(response) File "/home/pi/git/untitled/peeler/general/spiders/general_result.py", line 44, in parse_recipe item = self.parse_raw_recipe(recipe, recipe_language, site_name, response.request.url) File "/home/pi/git/untitled/peeler/general/spiders/general_result.py", line 21, in parse_raw_recipe item = RecipeItem.from_schema_org(recipe) File "/home/pi/git/untitled/peeler/scrapy_utils/items.py", line 81, in from_schema_org item.suitableForDiet = parse_suitable_for_diet(recipe.get('suitableForDiet', None)) File "/home/pi/git/untitled/peeler/utils/schema_org.py", line 102, in parse_suitable_for_diet return [DIET_MAP[data]] if data in DIET_MAP else None TypeError: unhashable type: 'list'