-
### Is your feature request related to a problem or challenge?
Part of https://github.com/apache/datafusion/issues/11752
StringView is a new arrow array type that allows for more efficient string …
alamb updated
3 weeks ago
-
**Is your feature request related to a ~problem or~ challenge? Please describe what you are trying to do.**
We have been using at least two parquet writers that both utilize the low-level APIs prov…
-
Hello, I have customized the syntax using Antlr4 (4.13.2), but when using the parser to parse, I found that the general performance loss for each syntax parsing is about 50ms. Our project can only all…
-
This is an umbrella ticket for adding Join support to Comet. In Spark, there are basically three types of Join operators: BroadcastJoin, HashJoin, SortMergeJoin. In DataFusion, two Join operators are …
-
# Environment
--------Version info---------
Polars: 1.6.0
Index type: UInt32
Platform: macOS-14.5-arm64-arm-64bit
Python: 3.12.3 (main, Jun 6 2024…
-
**Is your feature request related to a problem? Please describe.**
If a plan contains nested unions/concats, we can instead flatten those to a single operation
Example:
```py
df.concat(df.conca…
-
### SDK
Python
### Description
Once we upgrade to DataFusion 38, we will have access to the `named_struct` SQL function allowing us to create struct literals with names [^1][^2]. This should …
-
**What**
Alongside ClickBench, the most popular OLAP benchmarking framework is probably TPC-H. It's a much older benchmark, and is trusted by the older enterprises.
DataFusion already has native s…
-
### LanceDB version
0.5.2 @lancedb/lancedb
### What happened?
```
Error: lance error: LanceError(IO): Execution error: Row ids did not arrive in sorted order: integers are ordered up to the 0th el…
-
### Is your feature request related to a problem or challenge?
DataFusion: v41.0.0
I want to be able to write the following query as a prepared statement:
```sql
PREPARE get_N_rand_ints_from…