-
Make use of file statistics, stripe statistics, column statistics, row group indexes, and bloom filters
Need way to expose this functionality so users (like datafusion) can utilize to efficiently q…
-
### Describe the bug
You can trivial cause datafusion to use any amount of memory by simply running
```sql
select generate_series(9876543210);
```
Memory management functionality, e.g. [`Me…
-
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
We still have a few classes that do not yet have wrapper functions. Namely `datafusion.obje…
-
### Is your feature request related to a problem or challenge?
Hi all
I was going through SMJ implementation and suddenly stepped on the comments
```
// Sort-Merge joi…
-
### What is the problem the feature request solves?
DataFusion HashJoin LeftAnti doesn't support null-aware anti join.
See https://github.com/apache/datafusion/issues/10583
### Describe the poten…
-
# Background and Motivation
Currently, we have an in-house parser that is built on the `lalrpop` parser-generator. This has been good while the supported syntax has been simple. However, as the suppo…
-
# Description
The rust writer in it current state keeps a buffer instead of steaming to disk which causes the writer use quite some extra memory.
We need to address this performance issue.
@wjones1…
-
### Is your feature request related to a problem or challenge?
While reviewing https://github.com/apache/datafusion/pull/11943 from @Rachelint it is becoming clear to me that the hash aggregate co…
-
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
In addition to the information we already have in the online documentation, it would be hel…
-
Part of https://github.com/apache/datafusion/issues/11752 and https://github.com/apache/datafusion/issues/11790
Currently, a call to `REGEXP_MATCH ` with a Utf8View datatypes induces a cast. After …