bartosz25 / data-engineering-design-patterns-book

Code snippets for Data Engineering Design Patterns book
https://learning.oreilly.com/library/view/data-engineering-design/9781098165826/
33 stars 12 forks source link

Data Engineering Design Patterns

Code snippets

Welcome to the Github repository of the Data Engineering Design Patterns. The organization follows the book chapters.

Each pattern has a dedicated README.md that explains how to set it up on your local machine. The requirements for running the snippets are:

Disclaimer: although the examples tend to be the most realistic possible, their goal is to focus on one problem at a time. For that reason they'll often be simplified version of the code you should deploy on production. For that reason you might find hardcoded credentials or batch pipelines exposing the processed data without any data quality guards. Hopefully, thanks to the patterns presented in the book you'll be able to identify and apply all the best practices to your workloads.