-
### Description
Polars currently is the best dataframe experience for batch processing, it would be worth considering whether it'd be possible to support stream processing.
Some prior literature…
-
Can this be used to transform sql -> datafusion logical query plan in C++?
-
### env:
```
host
os:CentOS Linux release 8.2.2004 (Core)
kernel:4.18.0-193.el8.x86_64
container:
os:CentOS Linux release 7.4.1708 (Core)
kernel:4.18.0-193.el8.x86_64
cpu cores:128
spark: 3.4…
-
### Skill Name
Delta ecosystem icons
### Why?
I don't know how to create icons, I don't use ilustrator or figma. Please let me know how can I help to make this happen. what do you need?
It will …
-
### Is your feature request related to a problem or challenge?
If we want to have DataFusion used as the core of many new systems, we need it to be as easy as possible for someone to get their idea…
alamb updated
1 month ago
-
### Is your feature request related to a problem or challenge?
The last roadmap discussion we had seems to have worked out well to galvanize and get us organized around some common goals
- https://g…
-
In https://github.com/apache/arrow-datafusion/pull/3380 @thinkharderdev added support for evaluating filters during the parquet scan via the RowIndex mechanism 🎉
This feature is currently enabled…
alamb updated
1 month ago
-
### What type of enhancement is this?
API improvement
### What does the enhancement do?
The error messages in GreptimeDB have some issues:
* Errors from DataFusion always appear as interna…
-
Differences between rayexec & datafusion:
- In house makes feature dev easier
- rayexec pushed-base
- Arrow library ergonimcs
- Performance
Sean to expand
-
# Call to action:
Let's invest more effort in DataFusion benchmarking, both as a mechanism for technical evangelism as well as a guide for actual performance improvements.
# Background
We ha…
alamb updated
5 months ago