Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Apache Gravitino Introduction
Cover image for Apache Gravitino Introduction

Apache Gravitino Introduction

2
Comments
5 min read
S3-Native Kafka Alternatives: What's Actually Different

S3-Native Kafka Alternatives: What's Actually Different

Comments
3 min read
Day 12: UDF vs Pandas UDF
Cover image for Day 12: UDF vs Pandas UDF

Day 12: UDF vs Pandas UDF

Comments
2 min read
The Data Engineers Descent Into Datetime Hell

The Data Engineers Descent Into Datetime Hell

1
Comments
5 min read
Day 11: Choosing the Right File Format in Spark
Cover image for Day 11: Choosing the Right File Format in Spark

Day 11: Choosing the Right File Format in Spark

Comments
2 min read
Navigating the Future: Key Data Engineering Trends for 2024 and Beyond

Navigating the Future: Key Data Engineering Trends for 2024 and Beyond

Comments
6 min read
How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)
Cover image for How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)

How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)

Comments
7 min read
Day 7: Mastering Joins, Unions, and GroupBy in PySpark - The Core ETL Operations
Cover image for Day 7: Mastering Joins, Unions, and GroupBy in PySpark - The Core ETL Operations

Day 7: Mastering Joins, Unions, and GroupBy in PySpark - The Core ETL Operations

Comments
2 min read
The evolution of the Modern Data Stack: From RDBMS to the LakeHouse
Cover image for The evolution of the Modern Data Stack: From RDBMS to the LakeHouse

The evolution of the Modern Data Stack: From RDBMS to the LakeHouse

1
Comments
11 min read
map

map

Comments
1 min read
A Lightweight, Plugin-Oriented ETL Engine for Data Synchronization Built on Akka.NET

A Lightweight, Plugin-Oriented ETL Engine for Data Synchronization Built on Akka.NET

Comments
4 min read
Data Engineering Isn’t About Tools — It’s About Thinking Like This

Data Engineering Isn’t About Tools — It’s About Thinking Like This

1
Comments
2 min read
Data Engineering in 30 Days - Day 2

Data Engineering in 30 Days - Day 2

Comments
2 min read
Why Frontend Teams Should Care About Data Modeling for Real-Time Dashboards
Cover image for Why Frontend Teams Should Care About Data Modeling for Real-Time Dashboards

Why Frontend Teams Should Care About Data Modeling for Real-Time Dashboards

Comments
2 min read
Refactoring a Mature Airflow Project: A Practical Guide to Scaling from Solo Development to Team Collaboration

Refactoring a Mature Airflow Project: A Practical Guide to Scaling from Solo Development to Team Collaboration

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.