hhursev / recipe-scrapers

Python package for scraping recipes data
MIT License
1.74k stars 531 forks source link

No Directions and Instructions returend in some recipes from lekkerensimpel.com/ #761

Open JMenzingaST opened 1 year ago

JMenzingaST commented 1 year ago

Pre-filing checks

The URL of the recipe(s) that are not being scraped correctly

The results you expect to see A list with ingredients and a list with directions ...

The results (including any Python error messages) that you are seeing Empty lists. No error codes. ...

jayaddison commented 1 year ago

Ok, hmm. It seems that although lekkerensimpel does provide some schema.org metadata on the page -- that we collect using calls to self.schema.<field_name_here>() -- it doesn't provide Recipe-related schema metadata, at least not for these recipe URLs.

Probably the way to resolve this is by using self.soup (the BeautifulSoup HTML parser that is provided with the data for each page when the scraper runs) to use HTML-based extraction of the ingredients and instructions from the page.

Note: not all of the pages seem to include instructions! The Appel Kaneel Wentelteefjes recipe does for sure, though.