databrickslabs / dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
https://databrickslabs.github.io/dbldatagen
Other
302 stars 57 forks source link

Feature generate from data revised #204

Open ronanstokes-db opened 1 year ago

ronanstokes-db commented 1 year ago

Proposed changes

Improves code generation from existing data and analysis of existing data to determine possible categorical values in data being analyzed.

Additional minor improvements in code generation

Types of changes

What types of changes does your code introduce to dbldatagen? Put an x in the boxes that apply

Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your code.

Further comments

Part of ongoing enhancements to code generation from existing data

codecov[bot] commented 1 year ago

Codecov Report

Attention: 49 lines in your changes are missing coverage. Please review.

Comparison is base (1c8b340) 92.19% compared to head (d6f14eb) 91.17%. Report is 3 commits behind head on master.

Files Patch % Lines
dbldatagen/data_analyzer.py 81.22% 40 Missing and 9 partials :warning:
Additional details and impacted files ```diff @@ Coverage Diff @@ ## master #204 +/- ## ========================================== - Coverage 92.19% 91.17% -1.03% ========================================== Files 23 23 Lines 2754 2968 +214 Branches 471 512 +41 ========================================== + Hits 2539 2706 +167 - Misses 128 168 +40 - Partials 87 94 +7 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.

CLAassistant commented 9 months ago

CLA assistant check
All committers have signed the CLA.