Closed mgummelt closed 8 years ago
LGTM, only minor issues. I have run it locally runs fine.
Note: However, I am not so sure about the role tests. My opinion is that in local mode we can implicitly deduce if role resources are utilized although surprisingly mesos lacks of an api to ask for which type of resources are offered to and used by a framework any given time. Moreover, this info does exist in some form in the master log where you can see what resources were allocated.
Anyway, if everyone is ok we can fall to back to the simple assert if a role set exists in spark. Maybe in the future we could revise this.
@mgummelt A bit of context here: https://issues.apache.org/jira/browse/SPARK-10749. This was one of the motivators for this effort /cc @tnachen
The idea was to have an integration test for this scenario. If it is impossible, we can skip it and assume that the unit test will catch it.
It is true that we're assuming a controlled Mesos cluster, and running it in a multi-tenant setting is something we didn't consider. Happy to skip such tests in the DCOS suite, I'm just wondering if there's value in keeping it for the single-user case (docker).
This won't actually test SPARK-10749 directly, since this is testing the driver, and SPARK-10749 was a bug with the dispatcher, but I see your point. We want to test that a driver will correctly use an offer for a given role.
I have a few thoughts about this. I'll send an email.
@skonto ready for a review
ok im reviewing it... sorry for the delay...
now im back on it....
@mgummelt looks fine locally... could you check my comments above? @skyluc @dragos any comments?
@skonto I'm using call-by-value now. All good?
b.t.w I forked the master branch into our own custom version so we have more control over when tests get fixed. We're not planning on diverging, we just need the ability to merge hotfixes in ASAP.
I understand that but proper reviews need time...im not sure if that will work for you while somehow not diverging because the order that prs are merged will be different, prs maybe rejected or changed... am i wrong?
LGTM.
One nitpicking commend, otherwise LGTM.
@skonto we'll still be merging in all upstream changes. If there's a PR we need to fix tests, but it doesn't get accepted upstream, then we'll have to figure out what to do with it on a case-by-case basis. If it's absolutely needed, they there will be a slight divergence, yes. But that should be the exception. We're still going to be submitting all our PRs against master and contributing upstream.
cc @skonto