Open ash211 opened 7 years ago
Thanks @ash211 - For reference and comparison, I built the 2.2 release using the following:
$ ./dev/make-distribution.sh --pip --tgz -Pmesos -Pyarn -Pkinesis-asl -Phive -Phive-thriftserver -Pkubernetes -Phadoop-2.7 -Dhadoop.version=2.7.3
$ ./dev/make-distribution.sh --pip --tgz -Pmesos -Pyarn -Pkinesis-asl -Phive -Phive-thriftserver -Pkubernetes -Phadoop-2.7 -Dhadoop.version=2.7.3 -Phadoop-provided
I noticed that all but one unit test succeeded using only -Pkubernetes -Phadoop-2.7 -Dhadoop.version=2.7.3
.
I was unable to run integration tests out of the box because I use kvm instead of v-box for minikube. However I can probably spin minikube up separately and run them in shared-cluster mode, I need to test.
to build w/out hadoop deps included you have to use -Phadoop-provided
which I'm not sure everybody knows. Simply not specifying -Phadoop-x.x
defaults it to 2.2, it does not build without.
cc @erikerlandson
For reference, I use this script to build the Palantir Spark dist locally which might be usable as a starting point: