road86 / bahis-data

Repository for cleaning and adjusting BAHIS-related data
0 stars 0 forks source link

Compare shortlisted ETL pipeline tools #5

Closed shaarful closed 1 year ago

shaarful commented 1 year ago
WaliZaman commented 1 year ago

All three tools had their shortcomings that were described in the comments before closing the individual issues. Luigi:Made models according to the tutorials and codes provided by the documentation and other sites where people have used online sql databases. One model which used a single python code was successful, but n could not be replicated. The others that had multiple python codes all ended with the task failing. My own code ran partially, only for saving a xlsx file in destination folders. When made into multiple python codes, this task also failed. Prefect: Prefect could not be run/tested due to lack of documentation of codes. The codes provided to test the models did not run and returned an error. DBT: Followed the documentation to setup everything required to run DBT on python. An online account is needed to access and create the environment required to run each project/model. A lot of setting up is required to run the pipelines.

So for now, I will try to set up a pipeline using python, from scratch.