ErdemOzgen / Data-Engineering-Roadmap

Roadmap for Data Engineering
190 stars 30 forks source link
awesome awesome-list awesome-resources ci-cd cloud data-science database dataengineering datapipeline datapreprocessing datawarehouse deep-learning development devops guidelines interview machine-learning mlops roadmap

Disclaimer

The purpose of this roadmap is to give you an idea about the landscape. The road map will guide you if you are confused about what to learn next, rather than encouraging you to pick what is hype and trendy. You should grow some understanding of why one tool would be better suited for some cases than the other and remember hype and trendy does not always mean best suited for the job.

Give a Star! :star:

If you like or are using this project to learn or start your solution, please give it a star. Thanks!

Roadmap

Roadmap

Programming Languages

Learn Linux

There is two main parts for Linux learning: System Administration and Shell Scripting. You can arrange your learning depth with your preference

Data Structures and Algorithms / System Design

SQL

There are a number of good introductory SQL resources available for free and online. There are also some paid resources which I recommend for beginners, that are very effective, and well worth expensing in my opinion. A couple of notes:

Testing

CI/CD and Virtualization

Database Fundamentals

Relational Database

Non-Relational Databases

Data Processing

Messaging

Cluster Computing Fundamentals

Object storage

Datawarehouses

Monitoring Datapipelines

Data Visualization

Machine Learning and Deep Learning Tools

MLOPS tools

Cloud

Wrap Up

If you think the roadmap can be improved, please do open a PR with any updates and submit any issues. Also, I will continue to improve this, so you might want to star this repository to revisit. Idea from : Python Developer Roadmap

Contribution

The roadmap is built using Draw.io. Project file can be found at DataEngRoadmap.xml file. To modify it, open draw.io, click Open Existing Diagram and choose xml file with project. It will open the roadmap for you. Update it, upload and update the images in readme and create a PR (export as png with 400% zoom and minify that with Compressor.io).