-
### Describe the bug
## SQL
```
SELECT c8, length(c8) AS x FROM test0 ORDER BY c8;
```
### Spark Plan
```
AdaptiveSparkPlan isFinalPlan=true
+- == Final Plan ==
*(2) Sort [c8#8 ASC NULLS F…
-
### Describe the enhancement requested
hi there, as per https://github.com/pandas-dev/pandas/issues/58799#issuecomment-2128818332 i took it that pyarrow does not support writing parquet enum types ye…
-
on a partitioned table
```sql
CREATE TABLE iceberg.toto.a (
id uuid NOT NULL,
created_at timestamp(6) NOT NULL,
a varchar NOT NULL,
)
WITH (
format = 'PARQUET',
format_version = 2,
…
-
### Feature description
I'd like to be able to load STAC items that have bands referenced as distinct geotiffs. The current GTI docs indicate it is limited to using a `location` field that points to …
-
Hello
I tried to 'export_data' with Parquet file, which source is the large(2GB) size CSV, to SBDF.
When I tried with C library, I used 'tableslice' as recommended and it works fine.
But in Pytho…
-
I use presto to read Parquet file in HDFS. The parquet file has enable Parquet modular encryption.
Reading small file is fine, but while reading large file, it fail at the decrypt function.
Presto s…
-
I can download data up to 2024-05-31, but there seems to be no data available after this timepoint. Is this correct/intended, and did the data move anywhere else?
Or is it my error.
Example:
```
…
-
In https://github.com/zarr-developers/VirtualiZarr/pull/251#discussion_r1802972887, we've been discussing the detection of the parquet reference files.
This turns out not to be easy: while the dire…
-
### Describe the enhancement requested
In c++ implementation we have `ArrowReaderProperties.set_arrow_extensions_enabled` to enable this behavior.
See [comment](https://github.com/apache/arrow/pull/…
-
Currently the project use pyspark which depends on Spark and Hadoop, dependencies that do not natively run on Windows and make the script not Windows friendly, see #13 for the details. It seems that p…