recodehive / machine-learning-repos

A curated list of awesome machine learning frameworks, libraries and software (by language). I
https://machine-learning-repos.vercel.app/
MIT License
103 stars 194 forks source link

💡[Feature]: Youtube toxic comment segment using Classification #576

Closed ashis2004 closed 1 month ago

ashis2004 commented 2 months ago

Is there an existing issue for this?

Feature Description

Implementation process- Data Preprocessing: Cleaning and preparing text data for analysis, including handling duplicates, converting text to lowercase, and removing special characters.

Exploratory Data Analysis: Understanding the distribution of toxic vs. non-toxic comments, visualizing word clouds, and exploring key metrics like precision, recall, and F1-score.

Model Development: Building a neural network model using TensorFlow/Keras, training it on the dataset with oversampling techniques like SMOTE, and evaluating its performance using classification metrics.

Evaluation: The model achieved a high accuracy of 96.38% on the test set, demonstrating its effectiveness in accurately identifying toxic comments.

Visualizations: Visualizations such as bar plots for toxic vs. non-toxic comments, model accuracy and loss plots, and a heatmap of classification report metrics enhance the understanding of the model's performance.

Use Case

  1. Protecting Users: Identifying and filtering out toxic comments protects users from harassment, bullying, hate speech, and other forms of online abuse.
  2. Maintaining Civility: It promotes civil discourse and constructive interactions by deterring toxic behavior and encouraging respectful communication.
  3. Enhancing User Experience: Creating a positive online experience fosters user engagement, trust, and loyalty, leading to a vibrant and inclusive community.
  4. Compliance: Many platforms have policies against hate speech and offensive content, making toxic comment detection crucial for regulatory compliance and platform integrity.

Benefits

No response

Add ScreenShots

No response

Priority

High

Record

github-actions[bot] commented 2 months ago

Thank you for creating this issue! 🎉 We'll look into it as soon as possible. In the meantime, please make sure to provide all the necessary details and context. If you have any questions reach out to LinkedIn. Your contributions are highly appreciated! 😊

Note: This repo is for beginners to learn and start with Opensource we won't accept more than 10 issues from a single person, This restriction applies to Gssoc project which has a similar kind of adding folder files, Points will be reduced when we find Spam.

I Maintain the repo issue twice a day, or ideally 1 day, If your issue goes stale for more than one day you can tag and comment on this same issue.

You can also check our CONTRIBUTING.md for guidelines on contributing to this project.

github-actions[bot] commented 1 month ago

Hello @ashis2004! Your issue #576 has been closed. Thank you for your contribution!