Open DeepBlockDeepak opened 8 months ago
During model training, encountered a ValueError indicating a type mismatch, which prevented successful model fitting:
ValueError: could not convert string to float: 'Boulos, Mrs. Joseph (Sultana)'
This error suggested a fundamental issue in preprocessing pipeline where numeric data was being inadvertently cast to object types.
Every column, including those intended to be numeric (num__*)
, was cast to object
dtype, posing a serious problem for feeding the data into the models that expect numerical input.
The objective is to refactor current preprocessing and feature engineering workflow to integrate custom feature engineering steps into a scikit-learn pipeline, making the codebase more modular, maintainable, and efficient.
Goals
Tasks