datahq / dataflows

DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.
https://dataflows.org
MIT License
192 stars 39 forks source link

Comparison with Meltano, Mara, Airflow and other ETL tools #73

Open rufuspollock opened 5 years ago

rufuspollock commented 5 years ago

As a potential User of dataflows I want to understand how it compares to other tools so that I get what use cases it was designed for and why (or why not) i should use it (and also deepen respect for its creators because i know they know their stuff).

As an example of this done very well see VuePress https://vuepress.vuejs.org/guide/#why-not (short) and VueJS https://vuejs.org/v2/guide/comparison.html (long)

Tasks

augusto-herrmann commented 4 years ago

As far as I understand, Apache Airflow is a tool for orchestration and scheduling tool that could be used together with dataflows.

You define what to do to the data in dataflows. You define when to do it, manage job dependencies, what to do in case of failure, etc. in Airflow.