open-metadata / OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
https://open-metadata.org
Apache License 2.0
5.49k stars 1.04k forks source link

dbt Enhancements #15090

Open OnkarVO7 opened 9 months ago

OnkarVO7 commented 9 months ago

[1.3.2] - JSON Schema & Parsing Improvements

[1.4] - Tags & Glossaries

We currently sync dbt tags into OM by creating a new classification DBTTags and adding all that info as tags inside. What we need to figure out here is a way to directly link dbt tags into existing tags/tiers/glossaries in OpenMetadata. Example:

tables:
- name: DATA_TABLE
  description: Data ,
  columns:
  - name: gross_revenue
    description: column description
    meta:
      openmetadata:
      # DO NOT create anything new in OM, just link to existing items
        - type: GlossaryTerm
          name: BusinessGlossary.GrossRevenue
        - type: Classification
          name: Tier.Tier1

[1.4.1] - dbt run details

We need to figure out how to link datamodels with relevant information like:

We have 2 different topics here:

Create this as a Pipeline, show status, link the last status in the table, use Incident manager to track these pipeline status

[1.6] - Semantic Layer

How to integrate GENERALLY "Semantic Layer" data, be it from dbt metrics/exposures, Tableau Metrics, etc.

[1.6] - dbt Hooks

[1.3.2] - Documentation

Backlog

harshach commented 2 months ago

@OnkarVO7 lets assign this to @harshsoni2024 and complete anything remaining in dbt