Closed phgrosjean closed 1 year ago
Hi @phgrosjean,
You're right, I'll correct that and post the new benchmark results in this issue.
Thanks for the feedback!
I've made the corrections and while we wait for the book to become available again, here are the results on my computer:
Hello,
On your CSV benchmark, you use
read.csv()
(base function, very slow) for all three versions base/dplyr/data.table, while comparing to the Polars-specific CSV read function. Most of the time is spent in this function. So, the same timings for all three that is not representative of each implementation because tidyverse would usereadr::read_csv()
and data.table would usefread()
instead.Also, in your comparison between eager and lazy polars, you forgot to
collect()
the lazy version. It should be:Otherwise, excellent work !
PhG