alejandra-rojas / TheUltimate-RecipeManager

A recipe collection manager for seasoned cooks with an ever expanding collection of recipes. Fullstack: React & .NET webAPI
0 stars 0 forks source link

Add Recipe from URL #10

Open alejandra-rojas opened 1 month ago

alejandra-rojas commented 1 month ago

User will be able to provide a url for an article/page that contains a recipe, and the system should be able to return back a recipe entry, or a recipe form filled out for the user to submit/correct.

Will rely on AI to structure the data correctly and return in a json format.

However, we need to provide OpenAI with the text in the html document, as might not me possible for AI to navigate to a url.

I have built a basic scrapper using with cherrio: https://www.npmjs.com/package/cheerio that returns the main text in the page. However the code might only work for the scenario of Guardian recipes as other pages might have another html structure.

alejandra-rojas commented 1 month ago

Also need to consider the fact that some pages might have more than one page in a single html.