Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study
Cover image for Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study

Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study

Comments
5 min read
Part 1: Database Concepts & Architecture
Cover image for Part 1: Database Concepts & Architecture

Part 1: Database Concepts & Architecture

Comments
14 min read
Final Project Report 1: Schema Evolution Support on Apache SeaTunnel Flink Engine

Final Project Report 1: Schema Evolution Support on Apache SeaTunnel Flink Engine

Comments
4 min read
I Built an ETL Pipeline That Actually Thinks & And Cut Token Costs by 52% (And Here's What I Learned)
Cover image for I Built an ETL Pipeline That Actually Thinks & And Cut Token Costs by 52% (And Here's What I Learned)

I Built an ETL Pipeline That Actually Thinks & And Cut Token Costs by 52% (And Here's What I Learned)

1
Comments
17 min read
Firehose and Iceberg Tables
Cover image for Firehose and Iceberg Tables

Firehose and Iceberg Tables

2
Comments
4 min read
Beyond SQL: Solving Data Warehouse Performance Bottlenecks with Smart Algorithms, Not Just Bigger Clusters

Beyond SQL: Solving Data Warehouse Performance Bottlenecks with Smart Algorithms, Not Just Bigger Clusters

5
Comments
13 min read
From Pandas to Upstream Control: The Evolution PyData Needs Next

From Pandas to Upstream Control: The Evolution PyData Needs Next

Comments
6 min read
Statistics Day 2: Correlation Isn’t Causation — Here’s Why It Matters!
Cover image for Statistics Day 2: Correlation Isn’t Causation — Here’s Why It Matters!

Statistics Day 2: Correlation Isn’t Causation — Here’s Why It Matters!

5
Comments
4 min read
Unpacking the Google File System Paper: A Simple Breakdown

Unpacking the Google File System Paper: A Simple Breakdown

6
Comments
3 min read
Apache Dev List Digest: Iceberg, Polaris, Arrow & Parquet (Dec 9th - Dec15th, 2025)
Cover image for Apache Dev List Digest: Iceberg, Polaris, Arrow & Parquet (Dec 9th - Dec15th, 2025)

Apache Dev List Digest: Iceberg, Polaris, Arrow & Parquet (Dec 9th - Dec15th, 2025)

1
Comments
7 min read
Kafka consumer lag—Measure and reduce

Kafka consumer lag—Measure and reduce

Comments
5 min read
Understanding Kafka Consumer Lag: Causes, Risks, and How to Fix It
Cover image for Understanding Kafka Consumer Lag: Causes, Risks, and How to Fix It

Understanding Kafka Consumer Lag: Causes, Risks, and How to Fix It

Comments
3 min read
Building a dbt-UI I Wish Existed

Building a dbt-UI I Wish Existed

1
Comments
3 min read
Building a Real-Time Crypto Data Pipeline with Debezium CDC
Cover image for Building a Real-Time Crypto Data Pipeline with Debezium CDC

Building a Real-Time Crypto Data Pipeline with Debezium CDC

Comments
5 min read
Undestanding Kafka Lag, Why It Happens and How To Fix It.
Cover image for Undestanding Kafka Lag, Why It Happens and How To Fix It.

Undestanding Kafka Lag, Why It Happens and How To Fix It.

2
Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.