getindata / kedro-airflow-k8s

Kedro Plugin to support running pipelines on Kubernetes using Airflow.
https://kedro-airflow-k8s.readthedocs.io
Apache License 2.0
29 stars 11 forks source link

Pyspark support #112

Closed michalzelechowski-getindata closed 3 years ago

michalzelechowski-getindata commented 3 years ago

Supporting kedro projects which use pyspark. With this version, dataproc is the only supported spark infrastructure, but the overall design allows extensions, f.e. to AWS Glue. There's also the way to declare custom operators via configuration.


Keep in mind: