Closed chezou closed 6 years ago
It works fine with package = FALSE
option.
spark_apply(
iris_tbl,
function(e) broom::tidy(lm(Petal_Length ~ Petal_Width, e)),
names = c("term", "estimate", "std.error", "statistic", "p.value"),
group_by = "Species",
packages = FALSE)
# Source: table<sparklyr_tmp_1a4ec6a42> [?? x 6]
# Database: spark_connection
Species term estimate std.error statistic p.value
<chr> <chr> <dbl> <dbl> <dbl> <dbl>
1 versicolor (Intercept) 1.7812754 0.2838234 6.276000 9.484134e-08
2 versicolor Petal_Width 1.8693247 0.2117495 8.827999 1.271916e-11
3 virginica (Intercept) 4.2406526 0.5612870 7.555230 1.041600e-09
4 virginica Petal_Width 0.6472593 0.2745804 2.357267 2.253577e-02
5 setosa (Intercept) 1.3275634 0.0599594 22.141037 7.676120e-27
6 setosa Petal_Width 0.5464903 0.2243924 2.435422 1.863892e-02
I tried to run
broom::tidy
with CDSW, but it didn't work, because of lack oflibicui18n.so.55
. It works on Docker container and conda when I create the parcel.