databrickslabs / dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
https://databrickslabs.github.io/dbldatagen
Other
310 stars 58 forks source link

Uninstalling and reinstalling wheel on cluster running DBR 8.3 may fail #88

Closed ronanstokes-db closed 1 year ago

ronanstokes-db commented 2 years ago

If you have a named cluster specification in your Databricks environment and it had the current or a previous build of the datagenerator installed, when you uninstall the library and reinstall it , it may fail

Expected Behavior

Uninstall followed by reinstall should succeed

Current Behavior

Uninstall followed by re-install may fail.

Workaround

Our plan is to move to a PIP based install which should make installation easier

Your Environment

ronanstokes-db commented 2 years ago

You can also install in a Databricks notebook environment using the %pip command in a notebook cell. The %pip command does not illustrate the issue above.

To install the library via %pip, add a cell with the following content at the start of the notebook and execute it:

%pip install git+https://github.com/databrickslabs/dbldatagen

ronanstokes-db commented 1 year ago

Newest versions will be based on Databricks runtime 9.1 LTS or later