rjurney / Agile_Data_Code_2

Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
http://bit.ly/agile_data_science
MIT License
456 stars 307 forks source link

Hadoop not installed on vagrant machine #55

Closed JustinReboullot closed 6 years ago

JustinReboullot commented 7 years ago

After setting up the vagrant machine, ~/hadoop directory exists but is empty. This is probably due to the fact that http://apache.osuosl.org/hadoop/common/hadoop-2.7.3/hadoop-2.7.3.tar.gz returns now a 404 error. The problem is fixed after executing the hadoop part of the manual_install.sh script.

Note that I had to execute this lother line from manual_install.sh in order for spark to works export SPARK_DIST_CLASSPATH=`$HADOOP_HOME/bin/hadoop classpath`

MaxCaughron commented 6 years ago

the above worked for me. just replaced the hadoop-2.7.3 with hadoop-2.7.4 in all the lines in the manual setup and it worked on the VM box

rjurney commented 6 years ago

Thanks, I'll fix this!

rjurney commented 6 years ago

I bumped the version to 2.8.2.