tidyverse / duckplyr

A drop-in replacement for dplyr, powered by DuckDB for performance.
https://duckplyr.tidyverse.org/
Other
253 stars 15 forks source link

Illustrative benchmarks #215

Open hadley opened 1 month ago

hadley commented 1 month ago

I think it would be useful to include a couple of illustrative benchmarks that show the benefits of duckdplyr. Maybe one where there's some optimisation that duckdb makes that substantially improves performance on an in memory dataset, and one where you're working with a dataset that wouldn't generally fit in memory so you couldn't even use dplyr.