nlesc-sherlock / emma

Ansible playbook to create a cluster with GlusterFS, Docker, Spark and JupyterHub services
Apache License 2.0
3 stars 4 forks source link

Applications #49

Closed romulogoncalves closed 7 years ago

romulogoncalves commented 7 years ago

We were now able to read a Geotiff with Geotellis, extract a band and run kmeans from SparkMlib on it. PySpark is also working now, however, Geotrellis is not available in PySpark because the module available is too old.

We also added information in how to use HDFS web-ui so the user can navigate over the files stored HDFS and for example delete results so it can repeat the job.