-
When I connect spark by sparklyr, it works well.
> library(sparklyr)
> library(dplyr)
Attaching package: ‘dplyr’
The following objects are masked from ‘package:stats’:
filter, lag
Th…
-
Sparklyr has the very awesome position of starting an internal spark subprocess - which is different to all other spark interfaces out there today. This makes it possible to control all spark-configur…
-
Hi, I'm using sparklyr on Databricks DBR 14+, when i would like to arrange in descending order an error occurs, ascending is fine.
Here a small example:
```r
library(magrittr)
sc %
dplyr…
-
I am using `urltools` with sparklyr and the computation is unfortunately pretty slow. I wonder if the x3 slower computation with `suffix_extract` is expected? Can I improve somehow its speed?
Thank…
-
Dear sparklyr team,
I just stumbled upon the difficulties with setting up BLAS / FORTRAN acceleration on windows.
I think it would be a nice to have, if sparklyr would provide respective binarie…
-
sparklyr seems confused when a ``val`` is implemented on both companion object and respective class.
Here a minimal working example. Implementation on scala side:
```scala
package test
object …
-
@MilesMcBain mentioned that it is difficult to work with missing data within spark.
For example:
```r
library(sparklyr)
library(tibble)
library(dplyr)
dat
-
Hi everyone,
I've seen a lot of issues/questions around the below error:
Script used:
```
library(sparklyr)
Sys.setenv(SPARK_HOME='/usr/hdp/current/spark2-client')
config
-
I have a very simple query to grab two fields from a data 1) date 2) ID
```r
q
-
The current maze of if/else statements used to setup the various connections to the different CloudObjectStorage services is not so appealing or future-compatible.
https://github.com/ibm-watson-dat…