databrickslabs / dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
https://databrickslabs.github.io/dbldatagen
Other
291 stars 57 forks source link

Feature standard datasets - part 2 #286

Closed ghanse closed 1 month ago

ghanse commented 1 month ago

Proposed changes

Added the following standard datasets:

Types of changes

What types of changes does your code introduce to dbldatagen? Put an x in the boxes that apply

Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your code.

Further comments

N/A

codecov[bot] commented 1 month ago

Codecov Report

Attention: Patch coverage is 88.32487% with 23 lines in your changes missing coverage. Please review.

Project coverage is 92.39%. Comparing base (da1df6b) to head (7c77e2a).

:exclamation: Current head 7c77e2a differs from pull request most recent head e7a6216

Please upload reports for the commit e7a6216 to get more accurate results.

Files Patch % Lines
dbldatagen/datasets/basic_telematics.py 81.25% 8 Missing and 4 partials :warning:
dbldatagen/datasets/basic_process_historian.py 82.92% 6 Missing and 1 partial :warning:
dbldatagen/datasets/basic_geometries.py 95.91% 1 Missing and 1 partial :warning:
dbldatagen/datasets/benchmark_groupby.py 94.87% 2 Missing :warning:
Additional details and impacted files ```diff @@ Coverage Diff @@ ## master #286 +/- ## ========================================== - Coverage 92.63% 92.39% -0.24% ========================================== Files 38 42 +4 Lines 3380 3577 +197 Branches 589 631 +42 ========================================== + Hits 3131 3305 +174 - Misses 141 158 +17 - Partials 108 114 +6 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.