microsoft / azuredatastudio

Azure Data Studio is a data management and development tool with connectivity to popular cloud and on-premises databases. Azure Data Studio supports Windows, macOS, and Linux, with immediate capability to connect to Azure SQL and SQL Server. Browse the extension library for more database support options including MySQL, PostgreSQL, and MongoDB.
https://learn.microsoft.com/sql/azure-data-studio
MIT License
7.5k stars 882 forks source link

ADS doesn't find python libaries correctly when running Pyspark kernel on cluster #6237

Closed sfweller closed 5 years ago

sfweller commented 5 years ago

Steps to Reproduce:

  1. Install the 'bokeh' module using 'python -m pip install bokeh' both on the local machine running ADS and on a aris cluster you are connected to.

  2. Open a new notebook and select 'Pyspark3' as the kernel and connect to an aris cluster.

  3. At the beginning of the notebook add the line 'import bokeh'.

  4. Attempt to run the notebook, you will get the following error message:

    No module named 'bokeh' ImportError: No module named 'bokeh'

5). If you change the kernel to 'Python3', things will run successfully.

kevcunnane commented 5 years ago

I'm closing this as by design. Pip is for local package management. Remote package management isn't supported in the cluster yet. Once that's done, we can start adding support for it in the UI but until then it's not supported. Thanks!