Open Nalito opened 1 day ago
Hello! Im interested in working on this issue and can contribute especially with Data Cleaning. Is there a deadline for this?
Hello! Im interested in working on this issue and can contribute especially with Data Cleaning. Is there a deadline for this?
Hello @yoenuts, Thank you for your interest! Contribution to the project starts on Tuesday 1st, October to kickoff the hacktoberfest event but you can start contributing now.
Please follow our contribution guidelines Create a folder using the project name and your github name under this folder in the repo. It would contain your notebook.
We also encourage you to register for our kickoff call to get firsthand information on what we expect from your contributions: MLSA Hacktoberfest
Weโre looking forward to your participation!๐
Hi @Nalito! Iโm eager to contribute to the cleaning and preprocessing of the crime dataset! ๐งน๐
Plan: Data Cleaning:
Remove Duplicates: Identify and eliminate any duplicate entries. Handle Missing Values: Determine appropriate strategies for dealing with missing data (e.g., imputation, removal). Convert Date Formats: Ensure all date fields are in a standardized format for consistency. Feature Engineering:
Crime Types: Categorize crimes into defined types for easier analysis. Regions: Create region-based metrics for spatial analysis. Time-Based Metrics: Generate features that capture temporal trends (e.g., crime rates over time). Next Steps: Iโll review the existing scripts and documentation to understand the current setup and ensure my contributions align with the project's structure. Iโll also make sure to test my code thoroughly before submitting a pull request. If there are any specific guidelines or additional details youโd like me to follow, please let me know. Iโm looking forward to collaborating with everyone! ๐
Clean and Preprocess Crime Dataset
Description: Perform data cleaning, including removing duplicates, handling missing values, and converting date formats. Preprocess the data to create features like crime types, regions, and time-based metrics. Labels: Data Cleaning, Data Preprocessing
What is Needed
Contributors are needed to perform data cleaning, including removing duplicates, handling missing values, and converting date formats.
How to Contribute
Getting Started
Before you begin, ensure you have read the Contribution Guidelines in the repository
We are excited to see your contributions! Happy Hacking! ๐