feed0 / german-credit

Credit Risk Modeling with Python
MIT License
0 stars 0 forks source link
credit-modeling credit-risk credit-score machine-learning python statlog uci-machine-learning uci-ml-repository

Credit Score Analysis

Anton Markov et al (Credit scoring methods: Latest trends and points to consider, 2022) suggest that University of California Irvine's datasets are among the most popular public sources for credit score modeling. I have chosen the UCI (Statlog) German Credit Data to begin with.

German Credit Data

This dataset contains information about 1000 loan applications, including personal and financial data, credit history, and loan characteristics.

Objective

Train models in order to predict weather a loan is benefitial or not, in other words predict its creditability for the finantial institution.

Models

1. Logistic Regression

LogRegConfusionMatrix LogRegClassificationReport

Due to some imbalanced columns, the logreg model presents difficulty in predicting "Bad" loans. To overcome this limitation we might consider oversampling the misrepresented categories in these columns.