sfu-db / dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
http://dataprep.ai
MIT License
1.99k stars 203 forks source link

add the use-case of dataprep.eda for spark dataframe with ray #858

Closed Bowen0729 closed 2 years ago

Bowen0729 commented 2 years ago

Description

add a use-case of dataprep eda for spark dataframe.

we can support bigdata datasource via this use-case, we cannot use toPandas directly because the data is too big.

https://github.com/sfu-db/dataprep/issues/852#issuecomment-1071976007