recodehive / Scrape-ML

For new data generation Semi-supervised-sequence-learning-Project we have writtern a python script to fetch📊, data from the 💻, imdb website 🌐 and converted into txt files.
https://scrape-ml.streamlit.app/
MIT License
105 stars 137 forks source link

[Feature]:Identify sensitive content based on user comments #150

Closed Keerthana-T-P closed 4 months ago

Keerthana-T-P commented 5 months ago

I would like to propose adding a new feature to our web scraping tool that identifies sensitive content based on user comments scraped from IMDb. This feature will enhance the tool's functionality by providing users with a way to filter and classify movies based on their potentially sensitive content like 18+ content. Proposed Solution: The feature will involve: Collecting user comments from IMDb for a variety of movies. Training a machine learning model using NLP libraries(I will be using nltk) to classify movies as containing sensitive content or not based on the content of the comments. Integrating the trained model into the web scraping tool to classify new comments in real-time. Please assign this issue to me under the label GSSoC'24 @sanjay-kv

github-actions[bot] commented 5 months ago

Thank you for raising a issue, Hope you enjoing the open source. we try to reply or assign as soon possibe. Connect with mentor.

Keerthana-T-P commented 5 months ago

Thankyou for assigning me this issue.

github-actions[bot] commented 4 months ago

This issue has been automatically closed because it has been inactive for more than 30 days. If you believe this is still relevant, feel free to reopen it or create a new one. Thank you!