-
https://github.com/apache/parquet-java/blob/master/parquet-column/src/test/java/org/apache/parquet/column/impl/TestColumnReaderImpl.java#L55
-
> [!NOTE]
> _Originally posted by @dangotbanned in https://github.com/vega/altair/pull/3631#discussion_r1848380570_
> I thought this would be easier to discuss in a new issue over here
> Let me know …
-
### Bug description
Write parquet file requires disable gluten.
```
spark.sql("set spark.gluten.enabled=false")
spark.range(1000).selectExpr("id%2 as c1", "id%5 as c2", "id as c3").write.mode("o…
-
**Description of Bug:**
When downloading a large file through boto3, the completed file is unexpectedly large and corrupt.
**Steps to Reproduce:**
- Download a large file directly: `curl -O htt…
-
### Describe the bug, including details regarding any error messages, version, and platform.
LargeBinary and LargeString use int64 offsets, however Binary and String types use int32 offsets, this mak…
-
### Search before asking
- [X] I searched in the [issues](https://github.com/apache/paimon/issues) and found nothing similar.
### Paimon version
paimon-1.0-SNAPSHOT
### Compute Engine
flink
###…
-
### Describe the enhancement requested
When reading some column chunks but not all, parquet is building a list of "ConsecutivePartList", then trying to call the Hadoop api for vectorized reader of FS…
-
### Feature Request / Improvement
Hi all:
When using Amazon S3 object storage with Iceberg, there can be a throttling issue for the same path. By setting the parameter write.object-storage.enabl…
-
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
It is a pretty common usecase for systems (e.g. InfluxDB 3.0) to ingest data and write it a…
alamb updated
2 weeks ago
-
### Search before asking
- [X] I searched in the [issues](https://github.com/apache/paimon/issues) and found nothing similar.
### Motivation
NestedColumnReader has performance regression for simpl…