For new data generation Semi-supervised-sequence-learning-Project we have writtern a python script to fetch📊, data from the 💻, imdb website 🌐 and converted into txt files.
-Custom Exception: Introducing a ScrapingError exception class allows you to handle errors specifically related to the scraping process.
-Explicit Timeouts: Using LOAD_MORE_TIMEOUT and SCRAPE_TIMEOUT makes the code more readable and easier to adjust the wait times for different operations.
-Error Handling in main: The main function now handles the ScrapingError exception, allowing the script to gracefully skip pages that encounter scraping issues.
-Constant Use: Using constants like MAX_PAGES improves readability and maintainability.
Type of PR
[ ] Bug fix
[ ] Feature enhancement
[✅ ] Documentation update
[ ] Other (specify): ___
Screenshots / videos (if applicable)
[Attach any relevant screenshots or videos demonstrating the changes]
Checklist:
[ ✅] I have performed a self-review of my code
[✅ ] I have read and followed the Contribution Guidelines.
[ ✅] I have tested the changes thoroughly before submitting this pull request.
[✅ ] I have provided relevant issue numbers, screenshots, and videos after making the changes.
[ ✅] I have commented my code, particularly in hard-to-understand areas.
Related Issue
161
Description
-Custom Exception: Introducing a ScrapingError exception class allows you to handle errors specifically related to the scraping process.
-Explicit Timeouts: Using LOAD_MORE_TIMEOUT and SCRAPE_TIMEOUT makes the code more readable and easier to adjust the wait times for different operations.
-Error Handling in main: The main function now handles the ScrapingError exception, allowing the script to gracefully skip pages that encounter scraping issues.
-Constant Use: Using constants like MAX_PAGES improves readability and maintainability.
Type of PR
Screenshots / videos (if applicable)
[Attach any relevant screenshots or videos demonstrating the changes]
Checklist: