Closed liyinan926 closed 6 years ago
This will be the follow-up to https://github.com/apache/spark/pull/19468
This seems like a large diff, but a quick scan shows everything included as necessary. We need the driver service bootstrap because of changes to master. I think we can reduce the fanciness of the credentials step but that doesn't reduce the complexity by a significant amount.
One TODO: Add the unit test in https://github.com/apache-spark-on-k8s/spark/pull/542 to this PR
Changed from #542 merged in.
A few comments but otherwise this captures the spirit of what we want to have upstream.
If there's no objection, I will squash the commits and push to upstream for review by EOD today. @apache-spark-on-k8s/contributors
SGTM! We should see how we can make it less confusing for reviewers - because this PR encompasses changes in spark-kubernetes-3.
When pushing upstream, I'm gonna remove code for the first PR so this is less confusing.
The latest commit seems to address my comments so far. Thanks!
Squashed the commit and removed scheduler backend code and relevant changes in Yarn-related code.
@kimoonkim any more comments on the submission steps?
This is under review at https://github.com/apache/spark/pull/19717.
@ifilonenko @erikerlandson @mccheah Can you please help with integration tests here? We need signals on what's working and what's not working. We expect some integration tests to fail, but most should pass. @liyinan926 just informed me that we were missing some critical functionality with local://
file support. We want to ensure issues like that don't creep into our upstream PR.
cc @kimoonkim
To elaborate, we are missing DependencyResolutionStep
in this PR. Without it, users won't even be able to use local://
dependencies.
we may be having a problem here if the docker poms were elided - the integration testing can't build images to run its tests, IIUC
Tracking in https://github.com/apache-spark-on-k8s/spark/issues/568. @erikerlandson, you're right. We need to separate out the integration testing now, so we can actually run it against any distribution - irrespective of whether it has the dockerfile target or not.
It seems that the LocalDirectoryMountConfigurationStep
is also essential. @mccheah @foxish @kimoonkim @erikerlandson @ifilonenko to confirm. If yes, I will add it to this PR.
The PR has been merged upstream. Closing this.
Second draft upstreaming PR that contains the basic submission client implementation and unit tests. Branch
spark-kubernetes-3-updated
is a clone ofspark-kubernetes-3
with latest changes fromupstream/master
merged in.spark-kubernetes-4
includes all our changes inspark-kubernetes-3
.cc @foxish @mccheah @apache-spark-on-k8s/contributors