-
Enabling the hudi cleaner service (sync or async) throws an error when trying to upsert a record in spark append mode.
**To Reproduce**
I am running the following script in the Jupyter notebook:…
-
## Bug
### Describe the problem
Reading from a Delta table with a large Delta log takes a long time using `S3SingleDriverLogStore`. When calculating a snapshot of a Delta table `S3SingleDriverLogS…
-
**_Tips before filing an issue_**
- Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? Yes
- Join the mailing list to engage in conversations and get faster support at dev-sub…
-
hudi 0.10.1 / aws glue metastore
so far, glue does not suport the syntax used by hudi to make schema evolution `alter table replace colums ... cascade`
As a result, when a hudi table schema evol…
-
This is the official for discussing support for Delta Lake on S3 while writing from multiple clusters. The challenges of S3 support have been explained in #39 . While #39 tracks the work for a simpler…
-
Python: 3.7.10
pip: 20.2.2
cryptography: 36.0.2
Spark: 3.1.2-amzn-1
if I load the cryptography from --pyfiles, the error raised:
```text
[hadoop@ip-172-31-21-108 ~]$ pyspark --py-files depen…
-
I'm not sure if #1183 is related; there's something about S3 there.
Environment:
- AWS EMR emr-6.2.0
- Spark 3.0.1
- TF 2.4.0
- TF IO 0.17.0
My use-case is, I have Parquet files stored in AW…
-
### Community Note
* Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help…
-
**Describe the problem you faced**
Trying to run hudi with spark 3.0.0, and getting an error
**To Reproduce**
Steps to reproduce the behavior:
**Expected behavior**
**Environment Descri…
-
# sql1
```
select
account_channel,
count(*)
FROM hive.lb_bi_finance.dws_ipo_profit_and_loss_analysis_d
WHERE
pt IN ('2022…