Azure-Samples / modern-data-warehouse-dataops

DataOps for Microsoft Data Platform technologies. https://aka.ms/dataops-repo
MIT License
591 stars 463 forks source link
automatedtesting azure cicd data databricks datafactory dataops devops fabric

page_type: sample languages:

DataOps

This repository contains numerous code samples and artifacts on how to apply DevOps principles to common data engineering patterns and architectures utilizing Microsoft data platform technologies.

The samples are either focused on a single microsoft service (Single-Technology Samples) or showcases an end-to-end data pipeline solution as a reference implementation (End-to-End Samples). Each sample contains code and artifacts related to one or more of the following capabilities:

In addition to the samples, this repository also contains Utilities. These are simple scripts or code snippets that can be used as-is or as a starting point for more complex automation tasks.

Single-Technology Samples

Technology Samples
Microsoft Fabric ▪️ CI/CD for Microsoft Fabric
▪️ Feature engineering on Microsoft Fabric
Azure SQL database ▪️ CI/CD for Azure SQL database
Azure Databricks ▪️ CI/CD for Azure Databricks
Azure Data Factory ▪️ CI/CD for ADF with Auto publish
▪️ Data pre-processing using Azure Batch
Azure Stream Analytics ▪️ CI/CD for Azure Stream Analytics

End-to-End Samples

DataOps for Medallion with Azure Data Factory and Azure Databricks

This sample demonstrates batch, end-to-end data pipeline utilizing Azure Data Factory and Azure Databricks built according to the medallion architecture, along with a corresponding CI/CD process, observability and automated testing.

Medallion with Azure Data Factory and Azure Databricks

DataOps for Medallion with Microsoft Fabric

Utilities

Technology Utility Description
Microsoft Fabric ▪️ Script to upload file in GIT repo to Fabric lakehouse

Contributing

This project welcomes contributions and suggestions. Please see our Contributing guide.