recodehive / Scrape-ML

For new data generation Semi-supervised-sequence-learning-Project we have writtern a python script to fetch📊, data from the 💻, imdb website 🌐 and converted into txt files.
https://scrape-ml.streamlit.app/
MIT License
85 stars 115 forks source link

Adding Updated_Scrapper.py #162

Closed Harshitmishra001 closed 3 months ago

Harshitmishra001 commented 3 months ago

Related Issue

161

Description

-Custom Exception: Introducing a ScrapingError exception class allows you to handle errors specifically related to the scraping process.

-Explicit Timeouts: Using LOAD_MORE_TIMEOUT and SCRAPE_TIMEOUT makes the code more readable and easier to adjust the wait times for different operations.

-Error Handling in main: The main function now handles the ScrapingError exception, allowing the script to gracefully skip pages that encounter scraping issues.

-Constant Use: Using constants like MAX_PAGES improves readability and maintainability.

Type of PR

Screenshots / videos (if applicable)

[Attach any relevant screenshots or videos demonstrating the changes] Screenshot 2024-06-03 215503 Screenshot 2024-06-03 215515 Screenshot 2024-06-03 215523 Screenshot 2024-06-03 215532

Checklist: