mitodl / ol-data-platform

Pipeline definitions for managing data flows to power analytics at MIT Open Learning
BSD 3-Clause "New" or "Revised" License
35 stars 6 forks source link

Evaluate metadata management services for integrating with the data platform #855

Open blarghmatey opened 9 months ago

blarghmatey commented 9 months ago

User Story

Description/Context

In order to provide a cohesive view of our data platform and the various data sets that are available we want to implement a cross-cutting metadata platform. This will provide visibility into the full lineage graph of a given data asset (e.g. database table, dashboard report, report export delivered via Dagster, etc.). There are numerous open source and commercial options available, so the purpose of this issue is to establish a set of evaluation criteria and select a solution that we would like to implement.

Acceptance Criteria

Plan/Design

Review relevant documentation and pricing information available for each platform. Perform a simple proof of concept implementation of the top contenders.

blarghmatey commented 8 months ago

Products to be considered: