gregbaker / spark-celery

Helper to allow Python Celery tasks to do work in a Spark job.
Apache License 2.0
27 stars 6 forks source link

How to broadcast files using Spark-Context from spark-celery #7

Closed NeelBhowmik closed 1 year ago

NeelBhowmik commented 6 years ago

Dear all, I am using spark-celery in my application. I am calling my query-System (where I need to access broadcasted big files) from spark-celery. I want to broadcast (using spark-context) all the files from the spark-celery only one time, and keep it up in the memory. So that, for the each new request to the application, I don't have to broadcast the files each time. How can I do it from spark-celery? Unfortunately I am not able to find any solutions. Could you please suggest some way forward in this situation?

Thank you very much.

Kind regards, Neel