-
Hi,
I'm experimenting with using hadoop-connector and GCS as the backing filesystem for HBase and ran into some issues and wanted to know if they were known issues or not.
hadoop-connector versi…
-
Copying data from One cluster to Other Cluster
Teragen and Tersort Some Test:
Commands:
Teragen:
[hdfs@instance-1 ~]$ time yarn jar /opt/cloudera/parcels/CDH/lib/hadoop-0.20-mapreduce/hado…
-
**_Tips before filing an issue_**
- Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)?
- Join the mailing list to engage in conversations and get faster support at dev-subscri…
-
Environment
AWS EMR Serverless 7.0.0
PySpark 3.5.0
XGBoost 2.0.3
I’m using XBoost for regression, specifically the SparkXGBRegressor. I’m able to use it without issues on my local machine. How…
-
The test items are: https://github.com/openucx/sparkucx
Test environment: We set up a Hadoop cluster with 2 machines. The RDMA network card model is Mellanox Technologies MT27800 Family [ConnectX-5]
…
-
## CVE-2017-15713 - Medium Severity Vulnerability
Vulnerable Library - hadoop-common-2.5.1.jar
Apache Hadoop Common
Path to dependency file: /foxtrot-sql/pom.xml
Path to vulnerable library: /home/ws…
-
Want to understand that this spark matrics repo will work with Prometheus in the Hadoop cluster?
Actually I have a use case where my java application is deployed in Hadoop cluster and we run the ap…
-
**Describe the problem you faced**
A flink write hudi job, we have hdfs jitter, cause flink task to fail over, and see this error
**To Reproduce**
Steps to reproduce the behavior:
*have ch…
-
I have set up docker swarm cluster , use the following configuration file to deploy hdfs cluster on the overlay network named test in my swarm cluster .
version: '3'
services:
namenode:
im…
-
Hi, Im using petastorm to feed tensorflow models lunched with spark in an EMR cluster. The code is the basic to read parquet files on s3:
```
from pyarrow import fs
from petastorm.reader import Rea…