big-data-europe / docker-spark

Apache Spark docker image
2.04k stars 696 forks source link

How to connect via ODBC to Spark? #106

Open ghost opened 3 years ago

ghost commented 3 years ago

Expected Behavior I want to connect via Windwos ODBC to the docker-spark container.

Actual Behavior I receive an SSL error.

Steps to Reproduce the Problem

Version:

git clone https://github.com/big-data-europe/docker-spark
cd docker-spark
docker-compose up

Platform:

ghost commented 3 years ago

@GezimSejdiu are you the Windows/ODBC master here ;-)?

noahkawasakigoogle commented 3 years ago

I think you need to start up the Thrift Server to connect with ODBC? This particular image does not do that by default, I was trying to connect to Spark through JDBC from outside the container as well.

From within the master node:

cd /spark/bin && /spark/sbin/../bin/spark-class org.apache.spark.deploy.SparkSubmit --class org.apache.spark.sql.hive.thriftserver.HiveThriftServer2 spark-internal

After that, it might work