arahm071 / Melbourne-Modeling-Journey

Learn alongside me as I navigate the challenges of applying data science concepts to real-world data. This project highlights the importance of data preparation, modeling strategies, and the impact of data quality on analysis outcomes.
0 stars 0 forks source link

Regression Model Implementation for Melbourne Housing Dataset #5

Closed arahm071 closed 5 months ago

arahm071 commented 5 months ago

Introduction

This pull request details the development and refinement of a regression model for the Melbourne Housing dataset. It outlines the journey from an initial Ordinary Least Squares (OLS) model to a more robust LASSO model, addressing several key issues encountered along the way.

Background and Initial Model Creation

The branch began with the creation of an OLS regression model (3.1_ols_regression_model.py). The 3_regression_model.py file was initially empty, and the OLS model was the first attempt to analyze the dataset.

Approach and Methodology

Initial OLS Model Development:

Transition to LASSO Model:

ElasticNet Model Comparison:

Analysis and Findings

Key Improvements and Diagnostics:

Final Model Selection:

Conclusion and Future Directions

The LASSO model successfully addressed the issues found in the initial OLS model and proved to be the most suitable for our dataset. This process highlighted the importance of iterative modelling and diagnostics in regression analysis. Future considerations include the exploration of machine learning models to potentially enhance our analytical capabilities further.

Technical Details and Code Changes

Files Modified:

Files Added:

Testing and Validation