Fuenfgeld / 2022TeamADataEngineeringBC

This is a repository for a Data Engineering Tutorial
MIT License
0 stars 2 forks source link

ETL Process

Abstract

This repository contains the an ETL Process tutorial using Apache Spark and the DataVault concept. The tutorial will give you a short introduction to data engineering and and the ETL Process. You will learn how to extract data from various sources, transform them into a format suitable to analysis and load the data into your target database.

You will need about one hour for the tutorial. The tutorial contains presentations to explain all materials as well as exercises to practice the concepts. For futher reading written explainations of all concepts also exist.

Table of contents: