ccao-data / data-architecture

Codebase for CCAO data infrastructure construction and management
https://ccao-data.github.io/data-architecture/
5 stars 3 forks source link

Spike using local Dagster for EL and orchestration #518

Open dfsnow opened 1 week ago

dfsnow commented 1 week ago

While doing some research into potential software for this budget season, I came across dagster. It's an orchestration platform similar to Airflow that uses a declarative Python model similar to dbt. It integrates with dbt natively and also shares dbt's open-core licensing. IMO, this could be a great potential fit for replacing our ad-hoc Python/R ETL scripts. If we ever get the time, I think it would be worthwhile to do a local spike of dagster using the CLI.