LineaLabs / lineapy

Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
https://lineapy.org
Apache License 2.0
664 stars 58 forks source link

Argo Pipeline Integration #855

Closed lazargugleta closed 1 year ago

lazargugleta commented 1 year ago

Description

_module.py is generated in the same manner as Airflow's so it remained _requirements.txt as well _dag file provides an execution file to submit workflows to argo directly using hera-workflows. It is generated based on the argo_dag.jinja Since python function is a unit of execution it stays pretty Airflow-ish alike. It defines a WorkflowService, either parameterized by the user or given default parameters, which is submitted along the Workflow instance. Tasks and dependencies are generated in a different pattern but utilized the same methods already providing the tasks and tasks definitions. The service account token is searched for and appended to the WorkflowService as well. With the command python3 {pipeline_name}_dag.py the workflow is directly submitted to the current argo environment. _Dockerfile is fine

Current issues

Type of change

How Has This Been Tested?

lazargugleta commented 1 year ago

@andycui97 Thank you! Nice work on your end as well! I admire your patience!