WillKoehrsen / feature-selector

Feature selector is a tool for dimensionality reduction of machine learning datasets
GNU General Public License v3.0
2.23k stars 768 forks source link

Feature Selector: Simple Feature Selection in Python

Feature selector is a tool for dimensionality reduction of machine learning datasets.

Methods

There are five methods used to identify features to remove:

  1. Missing Values
  2. Single Unique Values
  3. Collinear Features
  4. Zero Importance Features
  5. Low Importance Features

Usage

Refer to the Feature Selector Usage notebook for how to use

Visualizations

The FeatureSelector also includes a number of visualization methods to inspect characteristics of a dataset.

Correlation Heatmap

Most Important Features

Requires:

python==3.6+
lightgbm==2.1.1
matplotlib==2.1.2
seaborn==0.8.1
numpy==1.22.0
pandas==0.23.1
scikit-learn==0.19.1

Contact

Any questions can be directed to wjk68@case.edu!