-
**Describe the problem you faced**
* Hudi 0.14.1
* Enabled Metadata Table + Enabled Bloom filter Indexing
* When enabling "hoodie.bloom.index.use.metadata=true" to use the Bloom filter Indexing i…
-
### Apache Iceberg version
None
### Query engine
None
### Please describe the bug 🐞
We were upgrading to spark 3.4.1 when we ran into this issue. Currently running on spark 3.2.1 which works. We'…
-
**Describe the problem you faced**
Hi Folks. I'm trying to get some advice here on how to better deal with a large upsert dataset.
The data has a very wide key space and no great/obvious partiti…
-
EMR - 5.13.0
Spark - 2.3.0
Running on m5.2xlarge 1 Master and 2 Worker nodes.
While running Haplotype Caller Spark on EMR, running into the stack overflow error.
`[hadoop@ip-xx.xx.xx.xx gatk]$ …
-
**Describe the bug**
Execution fails with the following error message on UI
```
Buffer overflow. Available: 0, required: 80893
Serialization trace:
underlying (org.apache.spark.util.BoundedPr…
-
I'm working on android project, with paper (with the rxpaper2 wrapper). I have some object data with Time members.
I saw in the Kryo code, that such serializer shoud be added by default if Java8 i…
-
**Describe the problem you faced**
Getting the following error when trying to run a spark job which reads and upserts a large amount of data into a hudi table.
```
org.apache.hudi.com.esoterics…
-
```
When trying to serialize a list with 1900 elements (simple beans which have
some string, int and date props) - total size of 1.5 MB, memory consumption
will exceed 2 GB and java OOM error will o…
-
[ERROR 2018-11-05 15:22:13 c.a.j.c.AsyncLoopRunnable:116 b-11-aggregate-stat-calc-aggregate-stat-all-agg-stat-ana-round-machine-agg-stat-ana-round-test-agg-stat-ana-round:33-TransferRunnable-0] Async …
-
```
When trying to serialize a list with 1900 elements (simple beans which have
some string, int and date props) - total size of 1.5 MB, memory consumption
will exceed 2 GB and java OOM error will o…