alteryx / featuretools

An open source python library for automated feature engineering
https://www.featuretools.com
BSD 3-Clause "New" or "Revised" License
7.19k stars 871 forks source link

Determine way to speed up calculate_feature_matrix #1967

Open gsheni opened 2 years ago

gsheni commented 2 years ago
Excidion commented 1 month ago

Maybe the issue can be helped by using something like narwhals to encourage users to "bring their own dataframe". This way performance would be mainly down to user choice.

MarcoGorelli commented 1 month ago

thanks @Excidion for your ping in the Narwhals repo, which brought me here!

I'm a fan of featuretools 🤩 (and made a little contribution a few years ago https://github.com/alteryx/featuretools/pull/727), it's exactly the kind of use case I had in mind for Narwhals. Polars really shines for feature engineering, see https://labs.quansight.org/blog/scikit-lego-narwhals for an example of the benefits that Narwhalifying scikit-lego brought

I'd be happy to help out here if you'd be interested, feel free to reach out (here or on Discord, link in the Narwhals readme) with any questions / requests! 🙏