databrickslabs / dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
https://databrickslabs.github.io/dbldatagen
Other
291 stars 57 forks source link

Feature distribution changes - migrated tests to Pytest, use of abstract base classes #277

Closed ronanstokes-db closed 1 month ago

ronanstokes-db commented 1 month ago

Proposed changes

Leverage abstract base classes in keeping with implementation of constraints, migrated tests to Pytest

Types of changes

What types of changes does your code introduce to dbldatagen? Put an x in the boxes that apply

Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your code.

Further comments

Use of abstract base classes as general pattern for any pluggable aspects

codecov[bot] commented 1 month ago

Codecov Report

Attention: Patch coverage is 83.33333% with 1 lines in your changes are missing coverage. Please review.

Project coverage is 92.59%. Comparing base (82ce5ce) to head (edc57c6).

Files Patch % Lines
dbldatagen/distributions/data_distribution.py 83.33% 1 Missing :warning:
Additional details and impacted files ```diff @@ Coverage Diff @@ ## master #277 +/- ## ========================================== - Coverage 92.62% 92.59% -0.04% ========================================== Files 33 33 Lines 2998 2996 -2 Branches 523 522 -1 ========================================== - Hits 2777 2774 -3 - Misses 131 132 +1 Partials 90 90 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.