-
### Description
- TFIDF notebook doesn't work
- Add test for notebook
- Add test for dataset
### In which platform does it happen?
### How do we replicate the issue?
…
-
Please fill out the form below.
### System Information
- **Spark or PySpark**: pyspark
- **SDK Version**: latest (pip version 1.4.2), EMR 5.23.0
- **Spark Version**: 2.4.0
- **Algorithm (e.g. K…
-
Hey,
I am trying to run pytest on my local PC, when databricks extension is installed.
I am trying to create local spark session:
```
def get_spark_session():
spark = (
SparkS…
-
Through performing steps on tutorial https://radanalytics.io/examples/pyspark_hdfs_notebook . I've created instance with hadoop and configured hadoop single node as specified here https://hadoop.apach…
-
In the current version of Spark Connect, the Spark Connect implementations of the `Column` and `DataFrame` classes aren't inherited from the corresponding classes, so the following checks will fail:
…
-
### Current Behaviour
I'm making a very simple Spark dataframe with only one column. Apparently, ProfileReport does not generate the report when I am using Databricks notebook.:
Below is the code th…
-
## Expected Behavior
DBLDATAGEN should be able to run with Databricks Serverless Instances as `spark.sql.execution.arrow.pyspark.enabled` is set by default.
## Current Behavior
Module fails as it…
-
As reported by @dmvieira in [this comment](https://github.com/mozilla/jupyter-spark/issues/25#issuecomment-305505778), the plugin doesn't work correctly if there are multiple notebook kernels running …
-
**Describe the bug**
Our codebase contains classloading-sensitve code such as
- code to support external sources such as Avro.
- Shim loading from parallel worlds
Classloader architecture in RE…
-
This looks neat, but the test notebook only shows example usage with pyspark. Does this work at all with Scala notebooks?