apache-spark-on-k8s / spark

Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apache/spark/
https://spark.apache.org/
Apache License 2.0
612 stars 118 forks source link

Include `Pkubernetes-hdfs-extra` for `token-refresh-server` in CI setup. #529

Open kimoonkim opened 7 years ago

kimoonkim commented 7 years ago

@ifilonenko @liyinan926 @foxish @erikerlandson

PR #453 implements a new server token-refresh-server that renews HDFS tokens used by long-running Spark jobs. The PR made the server an optional component that gets activated under a separate maven profile Pkubernetes-hdfs-extra.

We should include the profile in the Jenkins CI setup after #453 is merged in, so that subsequent PRs would build the project and run the unit tests. (The integration test will be addressed later)

For the main build, the command would look like the following (the bold part is new):

./build/mvn clean install -Pmesos -Pyarn -Phadoop-2.7 -Pkubernetes -Pkubernetes-hdfs-extra -Pkinesis-asl -Phive -Phive-thriftserver -DskipTests

The unit test will be like:

./build/mvn clean test -Pmesos -Pyarn -Phadoop-2.7 -Pkubernetes -Pkubernetes-hdfs-extra -pl core,resource-managers/kubernetes/core,resource-managers/kubernetes/token-refresh-server -am -Dtest=none '-Dsuffixes=^org.apache.spark.(?!SortShuffleSuite$|rdd.LocalCheckpointSuite$|deploy.SparkSubmitSuite$|deploy.StandaloneDynamicAllocationSuite$).*'

We should add this after #453 is merged in because maven will crash at the missing project resource-managers/kubernetes/token-refresh-server in the -pl parameter of the unit test build command line. (Missing profiles, on the other hand, are simply ignored)

Please feel free to ask questions or make suggestions. (This was discussed in the SIG meeting this morning)