-
**What**
Alongside ClickBench, the most popular OLAP benchmarking framework is probably TPC-H. It's a much older benchmark, and is trusted by the older enterprises.
DataFusion already has native s…
-
This is a collection of tickets related to making DataFusion's planning speed faster. Planning speed is the time from a SQL string being created to when the `ExecutionPlan` is created
- [x] https:/…
-
This has a list of performance improvements:
- [x] https://github.com/apache/arrow-datafusion/issues/5230
- [x] https://github.com/apache/arrow-datafusion/issues/4973
- [ ] https://github.com/apa…
-
### Describe the bug
I just cloned datafusion and tried `cargo t` on my ubuntu desktop, to check things were working properly.
It crashed.
I restarted, and it seems datafusion is using 50.1GB…
-
I recently wrote a blog [post](https://maxmeldrum.com/docs/posts/2024-05-14-uwheel-datafusion.html) about speeding up temporal aggregation queries significantly in DataFusion by using [µWheel](https:/…
-
This is a follow on to https://github.com/apache/arrow-datafusion/issues/3058 as we have made significant progress since @kmitchener originally posted that
“Write cool software and _tell people a…
-
ORC timestamp type is not straightforward, as though it apparently represents a timestamp without a timezone, its encoding & decoding is still dependent on the writer timezone (encoded in the stripe) …
-
### Is your feature request related to a problem or challenge?
We have had good luck writing up quarterly updates for DataFusion, most recently:
https://arrow.apache.org/blog/2024/01/19/datafusi…
alamb updated
3 weeks ago
-
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
I would like to use substrait with physical plans. I plan on having an initial PR up this wee…
-
The Python bindings currently only expose a subset of functionality, and we want to expose as much as possible.
Here is a list of all available rust methods. Note that there may be reasons why we …