AnotherSamWilson / miceforest

Multiple Imputation with LightGBM in Python
MIT License
353 stars 31 forks source link

Improve performance of pandas-heavy areas #13

Closed samFarrellDay closed 3 years ago

samFarrellDay commented 3 years ago

complete_data() is a major slow-down, there are probably others. miceRanger is much faster on small datasets because data.table is so much quicker. Can pandas be replaced with numpy in core areas?

AnotherSamWilson commented 3 years ago

Closed by dfa1790f0f91d0cc86f5e1de8cb19b2188d5ccfe