-
**Alluxio Version:**
2.9.1
**Describe the bug**
when the alluxio disk space is occupied, the alluxio can't free disk space. and this is my config:
`-Dalluxio.master.journal.type=EMBEDDED -Dallux…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Enhancement description
AWS supports Firehose dynamic partitioning of S3 prefixes by using keys in the streami…
-
**Bug Report:**
**Description:** Spark Scala streaming application reads dataset from EventHub and writes processed dataset to ADLS Gen2, that part of the application (without hadoop configuration)…
-
* Hudi version :0.13.1
* Flink version :1.13
Hudi Flink Config:
'connector' = 'hudi',
'path' = 's3://bnb-datalake-hudi/**********',
'table.type' = 'COPY_ON_WRITE', 'write.batch.size' = '5…
-
I'm trying to cdc data in upsert mode from Postgres. I notice when I partition the iceberg table by a column present in the source table, new records are appended instead of upserted.
Here is the s…
-
Does/will this support Delta Lake?
-
we have a glue streaming job that writes to hudi table, we try to do schema evolution, when we add a new col to any record, it works fine and the new col is shown when querying the table, the thing is…
-
- I am using debezium-server-iceberg to CDC data from RDBMS to iceberg table. I have run successfully and saved data into Minio with iceberg format.
- However, I am facing small datafiles problems. …
-
### Bug description
Trying to train using the ddp_notebook strategy and data stored in MDS format, I get the error above with the stack trace below.
### What version are you seeing the problem…
-
**Is your feature request related to a problem? Please describe.**
A clear and concise description of what the problem is. Ex. It would be nice to have [...]
Hi, I have a question about ingestion …