Closed Volodymyr128 closed 6 years ago
Also I did job submition request from terminal - the same result:
curl -d '{"samples": 10000}' -H 'Content-Type: application/json' -X POST http://localhost:2004/v2/api/functions/volodymyr.bakhmatiuk_hello-mist-java/jobs?context=volodymyr.bakhmatiuk_cluster_context
{"id":"a3dbd90c-3ca7-4bc7-910f-5c3c5901fb28"}
Thanks for the detailed description! I've just released v1.0.0-RC9 witch include fix for such problem (#411 )
Thank you for quick response! Now I get another issue - my jobs do not get to remote cluster:
18-02-09 23:01:08 [mist-akka.actor.default-dispatcher-18] INFO ere.mist.master.WorkersManager:107 Trying to start worker volodymyr.bakhmatiuk_cluster_context, for context: volodymyr.bakhmatiuk_cluster_context
18-02-09 23:03:08 [mist-akka.actor.default-dispatcher-17] WARN ere.mist.master.WorkersManager:131 Worker volodymyr.bakhmatiuk_cluster_context initialization timeout: not being responsive for 2 minutes
18-02-09 23:03:08 [mist-akka.actor.default-dispatcher-17] INFO ere.mist.master.WorkersManager:107 Worker for volodymyr.bakhmatiuk_cluster_context is marked down
18-02-09 23:03:08 [mist-akka.actor.default-dispatcher-25] INFO ist.master.FrontendJobExecutor:107 Job RunJobRequest(6ea5caa8-cec1-47d7-bd44-725601137dd9,JobParams(volodymyr.bakhmatiuk_hello-mist-java_0.0.1.jar,HelloMist,Map(samples -> 10000),execute)) id done with result JobFailure(6ea5caa8-cec1-47d7-bd44-725601137dd9,Worker volodymyr.bakhmatiuk_cluster_context initialization timeout: not being responsive for 2 minutes)
18-02-09 23:06:16 [mist-akka.actor.default-dispatcher-25] WARN mote.PhiAccrualFailureDetector:131 heartbeat interval is growing too large: 2766 millis
Is there any way to get more detailed logs to find out what is wrong?
Try to run mist from binaries: there are network limitations with docker mode and all logs will be collected in $MIST_HOME/logs
Thank you! That solved my issue
Glad to hear that.
Despite function was deployed with default context =
volodymyr.bakhmatiuk_cluster_context
, it is launched with default local context. Help me please to launch my job on my remote cluster!To launch
HelloMist
function on my cluster, I did four steps due to documentation:context=cluster_context
inhello_mist/scala/conf/20_function.conf
mvn package
mist-cli apply -f conf
Now I can check that function's context is linked to my cluster:
And I can check that configurations has been deployed:
Now I launch job through WebMist and it is finished successfully. But it looks like WebMist launch a job on local[*] spark cluster, because nothing have been launched on
myhost.com
cluster! Logs:P.S. My Spark cluster version equals 2.1.1.
I launch mist this way:
docker run -p 2004:2004 -v /var/run/docker.sock:/var/run/docker.sock hydrosphere/mist:1.0.0-RC8-2.2.0 mist