rjurney / Agile_Data_Code_2

Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
http://bit.ly/agile_data_science
MIT License
456 stars 306 forks source link

fix versions for spark jar path #88

Closed pjhinton closed 5 years ago

pjhinton commented 5 years ago

In the bootstrap.sh script, the following JAR file had changes in the versions that are retrieved from the web.

mongo-java-driver (3.4.2 -> 3.6.1) elasticsearch-spark (20_2.10-5.2.1 -> 20_2.11-6.1.2)

However, the echo command that writes JAR file names to the spark configuration file still has the old versions. This commit updates those to prevent exceptions being thrown when pyspark is being launched.

Might be worth setting some variables in the script that track what versions are being used so that having to edit the script multiple places in the future is avoided.