horinezachary / goose-database

This repository holds the frontend code and supporting code for the Goose Database Recipe Repository.
https://goose.horine.dev
1 stars 1 forks source link

Parse websites for initial five #2

Closed cryarr closed 4 years ago

cryarr commented 4 years ago

This is primarily for exploration and needs to be done to see what data we get from the parsing. This is also a good idea for seeing how to format the data for transformation.

cryarr commented 4 years ago

Can be found on branch: issue-#2

horinezachary commented 4 years ago

I think we should define the structure of our scrapers, and the output that they give. We need to have a standardized output for each recipe object, which should probably be some sort of JSON object. By doing this, we can have many scraping functions for many different website that can all be imported into the database by the same main function.

richjoregonstate commented 4 years ago

{
    "author": String,
    "title": String,
    "url": String,
    "ingredients": [{"name": String, "amount": Float}],
    "time": {
        "prep": Int,
        "cook": Int,
        "ready": Int
    },
    "directions": [Strings]
}

Something like this?