-
**Describe the bug**
When using Apache Spark on Amazon EMR, you have the ability to replace Apache Hive with AWS Glue as the Hive-compatible metastore, with the underlying data being stored in S3. Wh…
-
See similar git issues here:--
https://github.com/tensorflow/ecosystem/issues/61#issuecomment-363577011
https://github.com/tensorflow/ecosystem/issues/61
https://github.com/tensorflow/ecosystem/iss…
-
Hello, I am following the command docker run -v $(pwd):/job godatadriven/pyspark /job/samples/word_counter.py with my own python script and am getting this error:
Error: No main class set in JAR; ple…
-
> I have uses XLX file and 3 parqute files as source and performed some teansformation. the code ran good and i could able to see the linegae in spline. but i could able see only 3 parqute files as s…
wajda updated
3 months ago
-
After running the WriteJasonFile function in cell 12 of the Chapter 2: Designing Databricks Day One/Project: Streaming Transactions/CH2-01-Generating Records Using DBKS Labs Datagen.py notebook I get …
-
f = sqlContext.read.options(catalog=catalog).format("org.apache.spark.sql.execution.datasources.hbase").load()
File "/usr/hdp/2.5.3.16-1/spark/python/lib/pyspark.zip/pyspark/sql/readwriter.py", lin…
-
I was trying to run the test on my local spark but the code is not working. I've pasted the exact code which I ran down below and it breaks at the last line, `compare_results(expected, output, decimal…
-
Spark ML Lib offers both Multinomial and Bernoulli options for Naive Bayes according to this: https://spark.apache.org/docs/1.5.2/mllib-naive-bayes.html
We currently only offer the one option.
w2sgb updated
4 years ago
-
### Search before asking
- [X] I searched in the [issues](https://github.com/apache/incubator-paimon/issues) and found nothing similar.
### Motivation
Docs about s3 storage is fuzzy, need a exampl…
-
These would save the user from having to import `pyspark`, and could also set up `SparkConf` for you. Probably mostly matters for the inline runner (see #1965).