dbt-labs / hubcap

This app adds modules to the hubsite at hub.getdbt.com
13 stars 100 forks source link

Add dbt_eda_tools #324

Closed shankararul closed 2 months ago

shankararul commented 2 months ago

Description

A medley of macros that could be handy for your data exploration in DBT. The first iteration implements Get Missing Dates

✅ Get Missing Dates Finds all the missing dates in a model for the specified dimensions and filters according to the time granularity expected

🚧 Fill Missing Dates (Coming soon) Fills the missing dates in a model for the specified dimensions and filters according to the time granularity expected

🚧 Show as Percentage (Coming soon) Shows the value as percentage of the total value for the specified aggregations

🚧 Exploratory data analysis (Coming soon)

Numeric column exploration Get summary statistics such as Min, Max, Median, Null values, Percentiles, Standard deviation, etc. for numeric columns

Categoric column exploration Get summary statistics such as Count, Unique values, Null values for categoric columns

Timeseries column exploration Get summary statistics such as Start date, End date, granularity of the timeseries (day,month,year), null values, missing dates for timeseries columns

Docs site: https://shankararul.github.io/dbt_eda_tools/#!/macro/macro.dbt_eda_tools.get_missing_date Link to your package's repository: https://github.com/shankararul/dbt_eda_tools

Checklist

This checklist is a cut down version of the best practices that we have identified as the package hub has grown. Although meeting these checklist items is not a prerequisite to being added to the Hub, we have found that packages which don't conform provide a worse user experience.

First run experience

Customisability

Dependencies

Dependencies on dbt Core

joellabes commented 2 months ago

@shankararul this is so cool! My only concern is the overlap of calling it dbt_utils_medley when it's not related to dbt Labs' dbt_utils package. If you tweaked the name to something like dbt_medley or dbt_eda_tools or similar, I'd love to get this merged 🎉

shankararul commented 2 months ago

Hey @joellabes thanks for your quick turnaround. Indeed you're right. In order to avoid confusion with dbt_utils, I renamed the package to dbt_eda_tools (thanks for finding the perfect name for this package😍).