-
I am inferencing XGB model with spark_apply, it seems quickly complete all inference task, but hold on for many hours (10+ hours) at last task.
My input dataset is very large, up to 2 billion user …
-
My spark cluster works fine: version 2.1.0
But I am having issue with sparklyr 0.9.
I tried all modes (local, yarn-client and yarn-cluster) but no luck.
Error below:
Error in force(code) :
…
-
Providing us with the observed and expected behavior definitely helps. Giving us with the following information definitively helps:
- Sparkling Water/PySparkling/RSparkling version
- ai.h2o:sparkl…
-
Dev activity on missing features has been slow since the public announcement mid last year.
The C++ library as basis for a fast connection to Impala from Python and R seems indeed very promising.
Is…
-
Hi,
I am unsuccessful in writing to a table in Databricks Unity Catalog. I can easily read data from catalogs/schemas. I am using Python Databricks connect. I receive the same result regardless of …
-
It sometimes happens that one wants the next/previous **non-missing** value. It would be nice if this was supported in lead/lag functions, e.g., through `na.rm` argument.
This is also specified in S…
-
Hi. I'm working with the geospark sparklyr extension with huge spatial datasets (mostly points datasets).
When I need to compute a geospatial index like H3, I have to use spark_apply() with the R h3…
-
Running
``` r
library(sparksf)
library(sparklyr)
sc
-
Hi there,
The .noexport value within the foreach function doesn't seem to work correctly when both the function to loop over and the call to foreach itself live within various functions. Please se…
-
# Reporting an Issue with sparklyr
For general programming questions with `sparklyr`, please ask on
[Stack Overflow](http://stackoverflow.com) instead.
Please briefly describe your problem and,…