It is based on consolidated requirements and feedback from the review comments on the above closed PRs.
What
This PR separates the core implementation from PR#30, focusing on enabling disk caching for various sources supported by pg_analytics.
The benchmarking, specifically for Hive-style partitioned Parquet sources, leverages this core disk cache functionality and is implemented in paradedb/cargo-paradedb/src/pga_benches/pga_benchlogs_hsp_pq.rs.
Thank you for the review comments, I really appreciate it. Moving forward, I'll aim for smaller PRs with minimal changes to make them easier to review.
This PR is part of a pair; please review, validate, and consider merging both.
https://github.com/paradedb/paradedb/pull/1751 https://github.com/paradedb/pg_analytics/pull/148
Scope of this PR:
This PR supersedes the following previous submissions:
It is based on consolidated requirements and feedback from the review comments on the above closed PRs.
What
pg_analytics
.paradedb/cargo-paradedb/src/pga_benches/pga_benchlogs_hsp_pq.rs
.Why
How