Recode-Hive / Scrape-ML

For new data generation Semi-supervised-sequence-learning-Project we have writtern a python script to fetch📊, data from the 💻, imdb website 🌐 and converted into txt files.
https://scrape-ml.streamlit.app/
MIT License
80 stars 117 forks source link

Update scrapping.py #205

Closed bhanushri12 closed 2 days ago

bhanushri12 commented 3 days ago

Enhanced the listing of products into the csv file.

167

Description

Enhanced the script (pull request #167) to scrape product listings and prices from Amazon for specified categories (Laptop, Phones, Printers, etc.) and save them into a structured CSV file (prod_listings.csv). Each entry includes the category, product name, and price.

Type of PR

Checklist:

Additional context:

The script now uses Selenium to scrape Amazon for multiple product categories, captures both product names and prices, and organizes the data into a CSV file format for easy analysis.

Explanation:

•   Enhanced Script: Modified the existing script (from pull request #167) to iterate over a list of predefined product categories, scrape product names and prices using Selenium, and save the data into a structured CSV file.
•   Data Organization: Each row in the CSV file contains three columns: Category, Product Name, and Price, making it easy to analyze and compare product listings across different categories.
•   WebDriver Usage: Utilized WebDriver (ChromeDriver) to automate browsing tasks, handle dynamic content loading, and ensure accurate data extraction from Amazon’s search results.