apache / incubator-graphar

An open source, standard data file format for graph data storage and retrieval.
https://graphar.apache.org/
Apache License 2.0
195 stars 40 forks source link

[Feat][Doc] Benchmarking C++/Spark readers with/without filter pushdown #403

Open lixueclaire opened 3 months ago

lixueclaire commented 3 months ago

Is your feature request related to a problem? Please describe. Filter pushdown is a sophisticated feature available with C++/Spark readers that has the potential to enhance query performance. It's important to evaluate its effectiveness.

Describe the solution you'd like To gauge the impact of filter pushdown, I propose using the LDBC dataset to benchmark the performance of reading operations. Specifically, we can measure how efficiently the C++/Spark readers can filter vertices or edges with certain property conditions when filter pushdown is enabled compared to when it is not.

Additional context This request is in continuation of the discussion in issue #389