awslabs / data-on-eks

DoEKS is a tool to build, deploy and scale Data & ML Platforms on Amazon EKS
https://awslabs.github.io/data-on-eks/
Apache License 2.0
639 stars 217 forks source link

[Feature] - Use AWS serverless workflow service step functions from EKS #105

Closed victorgu-github closed 1 year ago

victorgu-github commented 1 year ago

Community Note

What is the outcome that you are trying to reach?

[AWS Step Functions] which is a serverless workflow service can integrate with Amazon EMR on EKS and [Amazon EventBridge] to build event-driven workflows. After installing [AWS Controllers for Kubernetes (ACK) in EKS, you can provision and configure serverless AWS resources: Amazon EventBridge and AWS Step Functions from EKS. The team can do the whole data operation without leaving the Kubernetes platform and only need to maintain the EKS cluster since all the other components are serverless.

The airflow infra is too heavy for some users. An data pipeline with serverless services will offload a lot from admin works while ACK controllers allow them to stay in EKS to control and config those serverless services.

Describe the solution you would like

Describe alternatives you have considered

airflow, argo workflows

Additional context

victorgu-github commented 1 year ago

Based on the comments from Vara and Apoorva, replace ACK and just use terraform

vara-bonthu commented 1 year ago

Lets push this feature to EMR on EKS Workshop content

github-actions[bot] commented 1 year ago

This issue has been automatically marked as stale because it has been open 30 days with no activity. Remove stale label or comment or this issue will be closed in 10 days

github-actions[bot] commented 1 year ago

Issue closed due to inactivity.