datastackacademy / deb-archive

TuraLabs FREE Data Engineering Bootcamp
Apache License 2.0
3 stars 6 forks source link

TuraLabs Data Engineering Bootcamp

Learn to deploy end-to-end Serverless Data Engineering Pipelines on GCP via the most comprehensive and FREE online course.

Description

This repo contains the code for the TuraLabs Data Engineering Bootcamp(DEB). Code scaffolds and starting data mentioned in the DEB lessons are contained in this repo.

QuickLinks

Course Pre-requisites

This course immediately starts covering mid to high level topics. Therefore, we strongly recommend that learners have some experience with Python and SQL. For a more in depth explanation of what Pre-requisites are expected and a list of resrouces to bring you up to speed, please visit our blog post on Helpful Resources to Prep for this Course.

Setting Up Your Dev Environment

The DEB course uses Python and Google Cloud Platform(GCP) tools. Please follow the instructions in our Getting Started Guide to make sure dev environment is properly set up and compatible with our course. If you have any issues getting your dev environment up, pleaes visit our Discord Channel to talk to us.

Need Help?

We're here to help! If you have any questions, please connect with us on our Discord Channel and one of us would be happy to help you!

Suggestions

If you have any suggestions for the course or website, please feel free to open a GitHub Issue within this repo. We also welcome suggestions in the suggestion channel on our Discord Server.

Patch Notes

20210121 New Chapter 3 Episode 4 Added

Chapter 3

20210108 2 New Chapter 3 Episodes Added + Small Fixes

Chapter 1

Chapter 3

20201204 New Chapter 3 Episode + Small Fixes

Chapter 1

-Fixed typos in chapter 1 overview (thank you Senad)

Chapter 2 -Added GCS source file download instructions to chapter 2 episode 2 (thank you Jason) -Fixed API registration link in chapter 2 episode 4 -Updated Portman API documentation links in chapter 2 episode 4 -Enhanced chapter 2 episode 5 webapp

Chapter 3

Blog -Added Spark Explained blog post

20201117 New Blog Post + Small Fixes

General

Website

Blog

20201021 Ch1 Update + Small Fixes

General

Ch1

New Ch1Ep5 lesson for advanced pandas use to replace the aircraft dataset to the latest FAA records

20200930 Ch2 Update + Small Fixes

Website

General

Ch1

Ch 2

Blog

DEB Repo

Ch 2

20200923 Small Fixes

General

Chapter 1

-change paths in code examples to reflect location of data in provided repo