PhilippGerstung / etl

Python ETL Process that loads data from tankerkoenig and loads them into a duckdb table
0 stars 0 forks source link

Implement concept for regular database updates #3

Closed PhilippGerstung closed 1 month ago

PhilippGerstung commented 2 months ago

Concept:

Use Raspberry PI to run dagster. In dagster we fetch the data from tankerkoenig (we keep a local copy of the repository data on the raspberry and copy it into the container, so fetching does not take as long). After fetching we create a update the duckdb incrementally and delete older entries