Azure-Samples / modern-data-warehouse-dataops

DataOps for Microsoft Data Platform technologies. https://aka.ms/dataops-repo
MIT License
588 stars 459 forks source link

E2E/MDW Governance - De-Identify dataset before move to Bronze w. Presidio #319

Closed balteravishay closed 3 years ago

balteravishay commented 3 years ago

Type of PR

Purpose

This PR adds a stage to the modern-data-warehouse governance sample which anonymizes the description column of the dataset using Presidio. This stage is implemented using Azure Databricks, before the dataset is saved to Bronze container.

Does this introduce a breaking change? If yes, details on what can break

Author pre-publish checklist

balteravishay commented 3 years ago

@devlace PR comments are resolved.