vaexio / vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
https://vaex.io
MIT License
8.28k stars 590 forks source link

[BUG-REPORT] - Issue with using vaex export_csv in a celery / airflow process #2075

Closed balihoo-gsuri closed 2 years ago

balihoo-gsuri commented 2 years ago

@JovanVeljanoski - Facing an Issue with using vaex export_csv in a celery / airflow process.

Description When trying to export a vaex dataframe in a celery worker or an Airflow DAG, an issue occurs ""daemonic processes are not allowed to have children"

Software information

Any help is appreciated.

balihoo-gsuri commented 2 years ago

Update: The issue was in the use of the apply function of vaex df. Got it working by passing multiprocessing=False to the apply function