issues
search
deordie
/
deordie-digest
Data Engineering Digest
https://digest.deordie.org
Creative Commons Attribution 4.0 International
27
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
InfoQ Software Architecture and Design Trends Report - April 2024 / InfoQ
#202
k-tomak
closed
4 months ago
1
Course Events and Event Streaming / Adam Bellemare @ Confluent Developer
#201
k-tomak
closed
4 months ago
1
Accelerating and Scaling dbt for the Enterprise / Dakota Kelley @ phData blog
#200
k-tomak
closed
4 months ago
1
Data Quality Score: How We Evolved the Data Quality Strategy at Airbnb / Clark Wright @ Netflix Data Engineering Open Forum 2024
#199
k-tomak
closed
4 months ago
1
Simplify PySpark testing with DataFrame equality functions
#198
Ceridan
closed
4 months ago
1
Data Domains — Where do I start? / Piethein Strengholt
#197
k-tomak
closed
8 months ago
1
Announcing Observable 2.0
#196
Ceridan
opened
9 months ago
0
How we built our customer data warehouse all on Postgres | Tembo
#195
Ceridan
closed
8 months ago
1
The 2023 edition of the Machine Learning, AI and Data Landscape — a quick analysis / Oliver Molander
#194
Ceridan
opened
10 months ago
0
Python 3.13 gets a JIT
#193
Ceridan
closed
8 months ago
1
How Meta built the infrastructure for Threads
#192
Ceridan
closed
8 months ago
1
The Ultimate List of Best Software Architecture Books (2024) / Patric Roos
#190
k-tomak
closed
4 months ago
0
Data Quality Score: The next chapter of data quality at Airbnb / Airbnb blog
#189
k-tomak
closed
11 months ago
1
Streaming SQL in Data Mesh / Netflix Blog
#188
k-tomak
closed
11 months ago
1
Metastable failures in the wild / Murat Demirbas @ Metadata Blog
#187
k-tomak
opened
1 year ago
0
Seven Principles of Cloud-Native Architecture/ Alibaba Cloud Native Community Blog
#186
k-tomak
closed
11 months ago
1
Why Kafka Is the New Data Lake? / RisingWave Labs
#185
k-tomak
opened
1 year ago
0
The problems in the Modern Data Stack / Diogo Silva Santos
#184
k-tomak
opened
1 year ago
0
Semantic Layer: Future of Self-Serve Analytics / Seckin Dinc
#183
k-tomak
opened
1 year ago
0
The Zoo of Consistency Models / The Educative Team
#182
k-tomak
closed
11 months ago
1
Is Kimball Still Relevant? / Joe Reis Blog
#181
k-tomak
closed
11 months ago
1
Seamlessly Migrate Your Apache Parquet Data Lake to Delta Lake / Dipankar Kushari, Uday Satapathy @ Databricks Engineering Blog
#180
Ceridan
closed
8 months ago
1
Why Uber Engineering Switched from Postgres to MySQL / Uber Engineering Blog
#179
Ceridan
closed
1 year ago
1
Dynamic Filtering: a Critical Performance Optimization in Analytical Engines / Vladimir Ozerov @ Querify Labs Blog
#178
k-tomak
closed
1 year ago
1
The State of Data Engineering 2023 / Einat Orr @ lakeFS blog
#177
k-tomak
closed
1 year ago
1
Privacy Enhancing Technologies: An Introduction for Technologists / Katharine Jarmul
#176
Ceridan
closed
1 year ago
1
Data Mesh in practice: Product thinking and development (Part III) / Ammara Gafoor, Ian Murdoch, Kiran Prakash @ Thoughtworks Blog
#175
k-tomak
closed
1 year ago
1
Empowering Azure Storage with RDMA / Murat Demirbas
#174
k-tomak
opened
1 year ago
0
5 Helpful Extract & Load Practices for High-Quality Raw Data / Sven Balnojan
#173
k-tomak
closed
1 year ago
1
Spark Connect Available in Apache Spark 3.4 / Databricks Blog
#172
k-tomak
closed
1 year ago
1
Spark SQL Query Engine Deep Dive Parts I and II – Adaptive Query Execution / Linxiao Ma
#171
k-tomak
closed
1 year ago
1
Following a database read to the metal / Hussein Nasser
#170
k-tomak
closed
1 year ago
1
Your Data Catalog Shouldn’t Be Just One More UI / Mahdi Karabiben
#169
k-tomak
closed
1 year ago
1
Functional Data Engineering — a modern paradigm for batch data processing / Maxime Beauchemin
#168
k-tomak
closed
1 year ago
1
Using Metrics Layer to Standardize and Scale Experimentation at DoorDash / Arun Balasubramani @ DoorDash engineering blog
#167
k-tomak
closed
1 year ago
1
The end of a myth: Distributed transactions can scale / Murat Demirbas Blog
#166
k-tomak
closed
1 year ago
1
Software Architecture and Design InfoQ Trends Report - April 2023 / InfoQ
#165
k-tomak
closed
1 year ago
1
Introducing Entity-Centric Data Modeling for Analytics / Maxime Beauchemin @ Preset Blog
#164
k-tomak
closed
1 year ago
1
Pushdown / Trino Query optimizer docs
#163
k-tomak
closed
1 year ago
1
The Question That Every Data Engineer Should Ask / Xinran Waibel @ Data Engineer Things Blog
#162
k-tomak
closed
1 year ago
1
Apache Spark — Job monitoring / Hareesha Dandamudi
#161
k-tomak
closed
1 year ago
1
Setting Uber’s Transactional Data Lake in Motion with Incremental ETL Using Apache Hudi / Uber
#160
DashaBulanova
closed
1 year ago
1
Concurrency is not Parallelism by Rob Pike
#159
DashaBulanova
closed
1 year ago
1
The Data Engineer’s Roadmap / James Phoenix
#158
k-tomak
closed
1 year ago
1
Essential Snowflake Cost Reduction Strategies / Niall Woodward @ select.dev Blog
#157
k-tomak
closed
1 year ago
1
The Chaos Data-Engineering Manifesto / Shane Murray @ Towards Data Science Blog
#156
k-tomak
closed
1 year ago
1
Distinct aggregation optimization in Apache Calcite and Trino / Querify Labs Blog
#155
k-tomak
closed
1 year ago
1
Solving Advent of Code with DuckDB and DBT / Graham Wetzler
#154
k-tomak
closed
1 year ago
1
BIG DATA IS DEAD / Jordan Tigani
#153
k-tomak
closed
1 year ago
1
Title: 5 Ways to Use Column Level Data Lineage / Montecarlo Data Blog
#152
k-tomak
closed
1 year ago
1
Next