-
I am using Azure Databricks and I am trying to read an Excel file (xlsx) from a Storage account (ADLS Gen2). I have the accountKey setup in the cluster config. In the storage account I have a folder a…
ghost updated
2 years ago
-
I am trying to compile the latest (1.6.0rc8 / git master -> PARQUET-214: Fix Avro string regression.) version of parquet-mr. The required versions of protobuf (2.5.0) and thrift (0.7) are installed an…
-
when i follow the guide and run "hadoop jar $JAR $CLASS --input $INPUT --output $OUTPUT", i met the problem:
"
java.io.IOException: Error opening job jar: ./target/spotify-hdfs2cass-2.0-SNAPSHOT-jar-w…
-
I'm not able to save the RDD to DynamoDB using Pyspark.
EMR 5.10
pyspark --jars /usr/share/aws/emr/ddb/lib/emr-ddb-hadoop.jar
For me, I'm able to read the Dynamodb data using the below code:
…
-
**_Tips before filing an issue_**
- Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)?
- Join the mailing list to engage in conversations and get faster support at dev-subscribe@h…
-
background:
We scaled the presto cluster from 4 to 32,The machine behind is mixed with hdfs(the first 4 independent deployment),Since then, there have been several errors when reading HDFS
presto ve…
-
When I select a hudi table in a kerberos cluster, it report GSS initiate failed.
```
Query 20220112_032437_00002_tmdm5 failed: Error checking path :hdfs://emr-master-1:8020/warehouse/tablespac…
-
Hi,
I have EKS cluster with EBS storage class/volume.
I am able to deploy hdfs namenode and datanode images (bde2020/hadoop-xxx) using statefulset successfully.
When I am trying to put a file to…
-
Likely an analogous problem to #1417.
```
./gatk-launch MarkDuplicatesSpark -I file:///home/unix/louisb/flag_stat.bam -O file:///home/unix/louisb/testoutput.bam -- --sparkRunner SPARK --sparkMaster y…
-
Hi
I was able to follow your instructions, but I am getting this error on this step: `hdfs dfs -copyFromLocal ./test.txt /test.txt`
```
root@bc548e4d1633:/home/big_data# echo "test" > test.txt
…