CoorpAcademy / docker-pyspark

Docker image of Apache Spark with its Python interface, pyspark.
40 stars 14 forks source link

New spark version based on alpine linux #9

Closed sarahmonod closed 8 years ago

sarahmonod commented 8 years ago

This PR fixes #5 and #6, with a completely different approach, based on alpine linux.

Note that the overall size has also been drastically reduced (about 2x smaller).

sarahmonod commented 8 years ago

There is an issue with PYSPARK_PYTHON, as it is set to python34 instead of python3.4.