recodehive / Scrape-ML

For new data generation Semi-supervised-sequence-learning-Project we have writtern a python script to fetch📊, data from the 💻, imdb website 🌐 and converted into txt files.
https://scrape-ml.streamlit.app/
MIT License
107 stars 136 forks source link

Implementing Scheduled Scraping Feature #129

Closed ShaikArshidBanu closed 4 months ago

ShaikArshidBanu commented 5 months ago

Description: Add functionality to the IMDb Movie Review Scraper to run at scheduled intervals automatically. This feature will ensure that the dataset is updated regularly without manual intervention.

Feature Details:

Suggestion: Implement a scheduling feature using Python's schedule library or system cron jobs to automate the scraping process. Benefit: Keeps the dataset updated with the latest reviews regularly and reduces the need for manual execution.

Expected Outcome: The scraper will run automatically at the scheduled intervals, ensuring the dataset is consistently updated with new reviews.

Additional Context: This feature is crucial for users who require up-to-date review data for analysis and research without the need for manual intervention. The implementation will be flexible to allow users to specify the desired interval for scraping.

@sanjay-kv please assign me this issue under GSSOC'24.

github-actions[bot] commented 5 months ago

Thank you for raising a issue, Hope you enjoing the open source. we try to reply or assign as soon possibe. Connect with mentor.

github-actions[bot] commented 4 months ago

This issue has been automatically closed because it has been inactive for more than 30 days. If you believe this is still relevant, feel free to reopen it or create a new one. Thank you!