-
I used Spark 3.4.1 and hudi 0.14.0 on GKE, streaming reading a hudi COW table(on GCS) and write to another hudi COW table(on GCS) with upsert(RECORD_INDEX), here is my write option:
write_streaming_h…
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/apache/doris/issues?q=is%3Aissue) and found no similar issues.
### Version
hadoop@olap-test2:~/xingying01/do…
-
**Describe the problem you faced**
I use Flink write Hudi COW table and sync to hive , but hive aggregate query (eg. count(*), row_number() over() )results has duplicate data but select * did not…
-
Some geospatial formats have started to include a spatial index [flatgeobuf](https://github.com/flatgeobuf/flatgeobuf#specification). I think would be great to include it as optional for geo parquet.
…
-
I am trying to load data from YugabyteDB which is streamed to Kafka and I am using Hoodie Sink connector to sink the data to a Hudi Table and getting following error.
[2023-11-19 14:20:22,236] WARN […
-
Hi Team,
I am facing an issue of duplicate record keys while data upserts into Hudi on EMR.
Hudi Jar -
hudi-spark3.1.2-bundle_2.12-0.10.1.jar
EMR Version -
emr-6.5.0
Workflow -
files…
-
Started with a fresh container. Ran the same scenario with Apache Hudi and didn't get this issue.
```
atwong@Alberts-MBP-3 sandbox % docker run -p 9030:9030 -p 8030:8030 -p 8040:8040 -itd --na…
-
Hello, I'm currently experimenting with the Hudi delta streamer and working on creating part 12 of the delta streamer playlist. For the next video, my goal is to cover the Hudi SQL-based transformer a…
-
Previous [Release Note 2.0.4](https://github.com/apache/doris/issues/29906)
Thanks to our community users and developers, about 217 improvements and bug fixes have been made in Doris 2.0.5 versio…
-
Clustering/Compaction job throw follow exception, the final result returns -1 and the job's state is success.
ERROR UtilHelpers: Cluster failed
org.apache.spark.SparkException: Job aborted due to …