julianpoy / RecipeClipper

A JavaScript util for scraping recipes from (almost) any website
GNU Affero General Public License v3.0
73 stars 3 forks source link

Poorly pulling seriouseats recipe #71

Open mudnug opened 2 months ago

mudnug commented 2 months ago

Using Auto Import via RecipeSage on https://www.seriouseats.com/thick-and-fluffy-pancakes. Expected image Actual Includes 2 Serious Eats / Vicky Wasik 3 times and Pancakes 1 MOTHER'S DAY 2 FATHER'S DAY

julianpoy commented 2 months ago

Interesting - Just imported the URL and got this result

image

Are you using the in-site import, or something else?

mudnug commented 2 months ago

We're getting the same results. I was just highlighting in my summary some erroneous portions that show up in the instructions in your screenshot.

mudnug commented 2 months ago

I've imported this one to give you an example of a 'serious' issue 😀 https://recipesage.com/#/recipe/32d8d7ef-3f67-4e21-86f4-546c09050021

mudnug commented 1 month ago

It appears <figcaption> is repeated over and over in these recipes and should be ignored when this text repeats, e.g, at https://www.allrecipes.com/recipe/25787/coconut-macaroons-iii/