Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Day 12: UDF vs Pandas UDF
Cover image for Day 12: UDF vs Pandas UDF

Day 12: UDF vs Pandas UDF

Comments
2 min read
Building a Near Real-Time Analytics Pipeline with AWS Zero-ETL

Building a Near Real-Time Analytics Pipeline with AWS Zero-ETL

Comments
4 min read
The Data Engineers Descent Into Datetime Hell

The Data Engineers Descent Into Datetime Hell

1
Comments
5 min read
Day 11: Choosing the Right File Format in Spark
Cover image for Day 11: Choosing the Right File Format in Spark

Day 11: Choosing the Right File Format in Spark

Comments
2 min read
Navigating the Future: Key Data Engineering Trends for 2024 and Beyond

Navigating the Future: Key Data Engineering Trends for 2024 and Beyond

Comments
6 min read
How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)
Cover image for How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)

How to Build Presto from Source - OSS Contribution Guide (Step by Step Tutorial)

Comments
7 min read
Day 7: Mastering Joins, Unions, and GroupBy in PySpark - The Core ETL Operations
Cover image for Day 7: Mastering Joins, Unions, and GroupBy in PySpark - The Core ETL Operations

Day 7: Mastering Joins, Unions, and GroupBy in PySpark - The Core ETL Operations

Comments
2 min read
The evolution of the Modern Data Stack: From RDBMS to the LakeHouse
Cover image for The evolution of the Modern Data Stack: From RDBMS to the LakeHouse

The evolution of the Modern Data Stack: From RDBMS to the LakeHouse

6
Comments
11 min read
map

map

Comments
1 min read
A Lightweight, Plugin-Oriented ETL Engine for Data Synchronization Built on Akka.NET

A Lightweight, Plugin-Oriented ETL Engine for Data Synchronization Built on Akka.NET

Comments
4 min read
Data Engineering Isn’t About Tools — It’s About Thinking Like This

Data Engineering Isn’t About Tools — It’s About Thinking Like This

1
Comments
2 min read
Proyecto Weather Service (Parte 1): Construyendo el Recolector de Datos con Python y GitHub Actions o Netlify

Proyecto Weather Service (Parte 1): Construyendo el Recolector de Datos con Python y GitHub Actions o Netlify

2
Comments 1
10 min read
Why Frontend Teams Should Care About Data Modeling for Real-Time Dashboards
Cover image for Why Frontend Teams Should Care About Data Modeling for Real-Time Dashboards

Why Frontend Teams Should Care About Data Modeling for Real-Time Dashboards

Comments
2 min read
Refactoring a Mature Airflow Project: A Practical Guide to Scaling from Solo Development to Team Collaboration

Refactoring a Mature Airflow Project: A Practical Guide to Scaling from Solo Development to Team Collaboration

Comments
4 min read
Apache Dev List Digest: Iceberg, Polaris, Arrow & Parquet (Nov 24-Dec 8, 2025)
Cover image for Apache Dev List Digest: Iceberg, Polaris, Arrow & Parquet (Nov 24-Dec 8, 2025)

Apache Dev List Digest: Iceberg, Polaris, Arrow & Parquet (Nov 24-Dec 8, 2025)

Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.