Closed ayanbizz closed 3 years ago
This error means Agent is not able to reach the Gateway over the network. If Spline works in local mode, it probably means the connection from the cluster to gateway is somehow blocked.
The agent is basically listener in the Spark driver and since in cluster mode the driver is in the cluster, you have to make sure that the Spline Gateway is accessible from cluster and the spark.spline.producer.url
is working when you go from cluster.
It looks more like Scalaj-Http or Hadoop issue, or a mix. Look at this line:
Caused by: scala.MatchError: org.apache.hadoop.fs.FsUrlConnection:http://someurl/producer/status (of class org.apache.hadoop.fs.FsUrlConnection)
A FsUrlConnection
connection is used in place where HttpUrlConnection
is expected.
A similar issue was reported in Spark - https://issues.apache.org/jira/browse/SPARK-25694
The issue should be fixed in Spark 3.0.0, and supposedly also backported to Spark 2.4. (see https://github.com/apache/spark/pull/26530#issuecomment-630493684)
@ayanbizz Can you try it with Spark 2.4?
I can confirm that with spark 2.4.5 we did not have such problems :)
Hello, does anyone have a work around on this besides updating spark? @wajda trying to see if we can have a work around on this from the spline agent side so we dont have to upgrade all apps to spark 2.4.5+ Thanks!
I would try replacing scalaj
with another Http client in the HttpLineageDispatcher
class.
thank you @wajda we will give that a try! Will look into other options for http clients, hope to get it working!
I am getting the following error while trying to initialize spline in yarn cluster mode. The error is related to the producer url access. I am able to run spline in deploy-mode = client , the issue is with cluster mode. This is how I tried to run the spark-submit