prrao87 / duckdb-study

Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
MIT License
26 stars 1 forks source link

Add pandas #3

Closed prrao87 closed 1 year ago

prrao87 commented 1 year ago

Goal

To showcase the power of embedded DBs, it makes sense to attempt the entire process with pandas as well as polars. There were numerous issues with reproducibility between polars and duckdb, but this was reconciled in this PR by adding pandas, and carefully debugging each intermediate DataFrame/table in the process.

Accomplishments

To do