-
http://h2o-release.s3.amazonaws.com/sparkling-water/rel-2.2/2/index.html
Above link suggests the following to install pysparkling package:
pip install pysparkling_2.2
But it should be as below:…
-
In 3.30.0.6, if hc.stop() was used, spark context also seemed to have intermittent errors.
Restarting H2OContext with same spark cluster fails:
Launched SW: `./bin/pysparkling --num-executors 3 --exe…
-
Taking the example from the readme and appending `counts.toDF().toPandas()` fails with the titular error. Complete code:
```python
PS > pip freeze | grep spar
pysparkling==0.6.2
PS > py
Python …
-
The exception in subject is thrown by the following code:
```python
from datetime import date
from pysparkling.sql.session import SparkSession
from pysparkling.sql.functions import collect_set
…
-
When working in the PR #138, I encountered an issue:
```python
from datetime import date
from pysparkling.sql.session import SparkSession
from pysparkling.sql.functions import collect_set
spa…
-
XGBoost training with PySparkling fails on some datasets and succeeds on other very similar datasets. The H2O context crashes, then the Spark executors die of ConnectionRefusedErrors. The failure seem…
-
By default Spark is case insensitive.
Pysparkling does not mimic this as when you want to create a column "ID", you can't query "id".
> however we can make it case sensitive by using configurati…
-
Right now R package is bundling multiple datasets, but Python only one.
Please, explore way how to bundle more datasets in Python package and expose methods to access them/access the folder holdin…
-
https://github.com/h2oai/h2o-3/blob/c0f9ffef3b68e4727b1efe36c1c1111850519ee9/h2o-py/h2o/frame.py#L2542
requires both index and columns to be pivoted to be of following data types:
"enum","time",…
-
For Azure databricks, the Sparkling water flow ui gives an error `HTTP ERROR 500 java.lang.NoSuchMethodError: org.apache.spark.ui.UIUtils$.listingTable(Lscala/collection/Seq;Lscala/Function1;Lscala/co…