Open surajwate opened 1 month ago
I have decided to develop a full package instead of a module.
Planned modules:
missing.py
# handling missingstandardization.py
# standardizing dataoutliers.py
# detecting and handling outliersconversion.py
# data type conversionencoding.py
# encoding and scalingvalidation.py
# data validationadvanced.py
# advanced cleaning techniquesutils.py
# utility functions
Data Cleaning Functions (
cleaning.py
)remove_missing(data)
: Function to handle missing data based on defined criteria.standardize_columns(data)
: Standardize column names to a consistent format.detect_outliers(data, method='IQR')
: Identify outliers using the interquartile range or other specified methods.