-
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
Upstream in DataFusion, there is a common common pattern where we have multiple input `Record…
-
I cannot get benchmarks running in k8s. I suspect that too many tasks are being scheduled in parallel.
I added resource constraints in the code:
```python
@ray.remote(num_cpus=1)
def execute_q…
-
### Is your feature request related to a problem or challenge?
This is a crazy idea
Now that DataFusion is the fastest engine for Parquet in ClickBench
- https://github.com/apache/datafusion/i…
-
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
As other DataFrame libraries start moving to leveraging GPU resources, it would be useful t…
-
### Describe the bug
## SQL
```
SELECT c8, length(c8) AS x FROM test0 ORDER BY c8;
```
### Spark Plan
```
AdaptiveSparkPlan isFinalPlan=true
+- == Final Plan ==
*(2) Sort [c8#8 ASC NULLS F…
-
### What is the problem the feature request solves?
During the Comet community call this morning, we discussed some options around how we keep up to date with changes in DataFusion core.
There a…
-
Could anybody point me to any docs on how to use/register datafusion-contrib projects through the python bindings?
thanks!
-
ORC spec: https://orc.apache.org/docs/types.html
> Hive always uses a struct with a field for each of the top-level columns as the root object type, but that is not required
See https://github.c…
-
### What is the problem the feature request solves?
Comet has native code for decoding Parquet structures into Arrow arrays. This issue is for discussing delegating to the [parquet crate](https://cra…
-
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
Sometimes it is desired to convert RecordBatches from one schema so they match another. Thi…