pola-rs / polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust
https://docs.pola.rs
Other
30.69k stars 1.99k forks source link

benchmark(python): Shift polars TPC-H benchmarks to main repo and include in CI #7740

Open chitralverma opened 1 year ago

chitralverma commented 1 year ago

Problem description

This is in reference to a discussion @ritchie46 and I were having on discord regarding the TPC-H benchmarks.

The idea is the following,

this will clearly show the rate of improvement of the project in terms of performance.

armgilles commented 1 year ago

Maybe an idea about monitoring performance for polars pytest-codspeed. It's used by pydantic-core and seems pretty easy and robust to use to monitor a bump or regression in terms of performance in CI (exemple https://github.com/pydantic/pydantic-core/pull/337)

chitralverma commented 1 year ago

datafusion did something similar i believe

https://github.com/apache/arrow-datafusion/pull/6131