Closed EthanSteinberg closed 3 months ago
duckdb (https://duckdb.org) is a powerful data processing library.
It turns out that it works extremely well for our MEDS Flat -> MEDS ETL.
This implements a duckdb backend for that ETL, with associated unit tests. It is much much faster than the old ETL.
duckdb (https://duckdb.org) is a powerful data processing library.
It turns out that it works extremely well for our MEDS Flat -> MEDS ETL.
This implements a duckdb backend for that ETL, with associated unit tests. It is much much faster than the old ETL.