Open ivanchai-1 opened 4 months ago
We have a problem converting uploads from Spark to Polars. This conversion is implemented in Pandas, but it takes a very long time to work on huge data. I would like to see this function in Polars for faster task calculations.
pandas_df = spark_df.toPandas() polars_df = pl.DataFrame(pandas_df) ↧↧↧↧ polars_df = pl.DataFrame(spark_df)
https://stackoverflow.com/questions/73203318/how-to-transform-spark-dataframe-to-polars-dataframe
@deanm0000 https://github.com/apache/spark/pull/45481, toArrow() will be available in spark 4.0 afaik
Description
We have a problem converting uploads from Spark to Polars. This conversion is implemented in Pandas, but it takes a very long time to work on huge data. I would like to see this function in Polars for faster task calculations.
pandas_df = spark_df.toPandas() polars_df = pl.DataFrame(pandas_df) ↧↧↧↧ polars_df = pl.DataFrame(spark_df)