-
Hi,
We have setup nutch on amazon EMR using below tutorials
https://github.com/eleflow/nutch-aws
The problem is we are not able to copy crawl file from EMR to bucket.
Can anyone please help how t…
-
**Describe the problem you faced**
Async Cleaner OOM / slowdown after creating a large Savepoint
**To Reproduce**
Steps to reproduce the behavior:
1. Create a large savepoint e.g. 2GB,
2.…
-
**_Tips before filing an issue_**
- Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)?
- Join the mailing list to engage in conversations and get faster support at dev-subscri…
-
The situation has been getting better wrt Spark jobs running on spot instances in EMR recently (https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-spark-configure.html) so it might be interesting…
-
The situation has been getting better wrt Spark jobs running on spot instances in EMR recently (https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-spark-configure.html) so it might be interesting…
-
```
What steps will reproduce the problem?
1. Load EMR
2. setCredentials(xxx,xxx)
3. emr.test
-
* Hudi version :0.13.1
* Flink version :1.13
Hudi Flink Config:
'connector' = 'hudi',
'path' = 's3://bnb-datalake-hudi/**********',
'table.type' = 'COPY_ON_WRITE', 'write.batch.size' = '5…
-
```
What steps will reproduce the problem?
1. Load EMR
2. setCredentials(xxx,xxx)
3. emr.test
-
Spark 2.4.4 version
emr release 5.29
Due to some constraint, i can not use port number 18080 for spark history server. Spark history server points to other than 18080(example: 18480).
I am gett…
-
### Community Note
* Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help the com…