vanderschaarlab / autoprognosis

A system for automating the design of predictive modeling pipelines tailored for clinical prognosis.
https://www.autoprognosis.vanderschaar-lab.com/
Apache License 2.0
119 stars 27 forks source link

Can Autoprognosis handle collinearity? #78

Closed williamty closed 2 months ago

williamty commented 1 year ago

Collinearity is a common problem in medical analysis. For example, age group is classified data, and age is numerical data. If the data is not collinearly processed, it will produce a very high prediction accuracy, but it is meaningless. How will Autoprognosis handle that?

DrShushen commented 2 months ago

AutoPrognosis handles multi-collinearity here: https://github.com/vanderschaarlab/autoprognosis/blob/2b71de4cddb1fc63ed48c1df244e4afcdce802ce/src/autoprognosis/plugins/preprocessors/dimensionality_reduction/plugin_data_cleanup.py