Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Expected Behavior
DBLDATAGEN should be able to run with Databricks Serverless Instances as
spark.sql.execution.arrow.pyspark.enabled
is set by default.Current Behavior
Module fails as it checks for the value of the configuration.
Steps to Reproduce (for bugs)
Connect a notebook to a Databricks Serverless Instance. Import dbldatagen Try creating a spec with DataGenerator.
Context
N/A
Your Environment
Databricks Serverless Instance
dbldatagen
version used: 0.4.0