csgeeek / scan-to-healthify

This webapp finds harmful ingredients in food products by extracting UPC from barcode. The ingredients dataset is stored in firestoredb
MIT License
3 stars 1 forks source link

Create scrapers to scrape ingredients of food items from online #5

Open csgeeek opened 1 year ago

csgeeek commented 1 year ago

We need more data (ingredients). The best source is the Internet. Build scrapers that could crawl websites and collect ingredients of respective food products.

You can use Scrapy or any other web scraping tool of your choice.

Tushar98644 commented 1 year ago

assign me...

Tushar98644 commented 1 year ago

@Yaswanth820 could you provide some resources on how to get started....

csgeeek commented 1 year ago

Hey @Tushar98644

This task is related to webscraping. I would prefer using Scrapy for this task. You can choose a grocery website (Bigbasket, Dunzo etc.) and scrape data of a few food items and their respective ingredients and store them in item-containers.

Please feel free to ask any further queries on discord channel or here itself. I will be available anywhere.