skillenza-com / MishMash-India-2020

MishMash hackathon is India’s largest online diversity hackathon. The focus will be to give you, regardless of your background, gender, sexual orientation, ethnicity, age, skill sets and viewpoints, an opportunity to showcase your talent. The Hackathon is Live from 6:00 PM, 23rd March to 11:55 PM, 1st April, 2020
2 stars 12 forks source link

SCH3M3_SH3LL - Social Scraper - Machine Intelligence / Social Impact #58

Closed Aravindha1234u closed 4 years ago

Aravindha1234u commented 4 years ago

SCH3M3_SH3LL - Social Scraper - Machine Intelligence / Social Impact

Project information

  1. Theme: Machine Learning / Social Impact

  2. Project Name: ## Social Scraper

  3. Short Project Description: Social Scraper is a python tool meant for Detection of Child Predators/Cyber Harassers on Social Media

  4. Team Name: SCH3M3_SH3LL

  5. Team Members:

    Aravindha Hariharan M
    Kabilan S
    Gowtham G
    Giridhara Prasath G

  6. Demo Link: Demo link

  7. Repository Link(s): Repository

  8. Presentation Link: Presentation

  9. Raw File:


🔥 Your Pitch

The proposed idea intends to identify child predators/cyber harassers in social media with malicious intent.

The tool detects suspect profiles based on child grooming behavior patterns/cyber harassers on the social media platforms ​ manually ​ which may lead to drain out of time and resources. To resolve this, a new automated system is employed to identify cyber predators/offenders using ​ machine intelligence​ .

This system is capable of analyzing all social media platforms like Instagram, Twitter, Facebook, LinkedIn, etc., and other outlets seeking the same suspect. If the suspect doesn’t have the same user ID on different platforms, then Reverse Image Searching is done to identify the suspect. A set of user_id is used as a key to grab their personal information and their post information(Post ID, Comments, Timestamp, location, Captions) from multiple social platforms using ​ OSINT(Open Source INTelligence) and Beautifulsoup Python Package. The above data of various posts are subjected to analyze malevolent contents using Machine Learning and Pandas Python library. Based on the statistical analysis, suspects are categorized based on their behavior(also Polite harassment). The users whose suspect level is greater than the threshold value will be scrutinized and monitored for further analysis. The suspected user’s post information(media like Image, Audio, and Video) is retrieved and analyzed using the ​ IGPL Python package, ​ Urllib, and ​ Artificial Intelligence with ​ NSFW (Not Safe For Work) library to make them fall under the category suspects/predators.

Finally, the Child grooming patterns followers and statistical results that are generated are analyzed and the concerned person is classified as predator and reported to the law enforcement authorities

🔦 Any other specific thing you want to highlight?

✅ Checklist

Before you post the issue: