-
### Describe the enhancement requested
In our standard ( https://github.com/apache/parquet-format/blob/master/Encodings.md#dictionary-encoding-plain_dictionary--2-and-rle_dictionary--8 ) , the dict f…
-
trying this code
```
import glaredb
import pandas as pd
df = pd.DataFrame(
{
"A": [1, 2, 3, 4, 5],
"fruits": ["banana", "banana", "apple", "apple", "banana"],
}
)
…
-
### Feature description
the parquet data format is increasingly popular; existing GDAL-OGR code[0] relies on Apache Arrow libs to ingest parquet .
There exists a pure-python alternate `fastparq…
-
See previous discussions in mail thread: https://lists.apache.org/thread.html/re9896b902cddc0931e4efbdecf27203710fb87505b63e927eef7ea77%40%3Cdev.arrow.apache.org%3E
**Reporter**: [Hongze Zhang](https…
-
https://issues.apache.org/jira/browse/ARROW-10645
The idea is to port the original benchmarks from https://github.com/sunchao/parquet-rs/tree/master/benches in ervice of helping to get https://git…
-
The [Arrow C Data/Stream Interface](https://arrow.apache.org/docs/format/CDataInterface.html) is a relatively lightweight option for developers that want to expose Arrow Arrays to Python users.
It …
-
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
There were bunch of discussions that generated column names confusing, hard to read and hard …
-
**Current status**
1. Apache Arrow Java is compiled and tested by JDK 8, 11, 17, 18 but packaged with source/target JDK8
**Changes implemented by this PR** [https://github.com/apache/arrow/pull/…
-
**Describe the bug**
See PR description at https://github.com/apache/arrow-ballista/pull/340
**To Reproduce**
See PR description at https://github.com/apache/arrow-ballista/pull/340
**Expected…
-
### Is your feature request related to a problem or challenge?
While reviewing https://github.com/apache/arrow-datafusion/pull/7708 from @sarutak I spent time looking into what `plan_q` does and it i…