Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
11 Compaction Optimizations for Iceberg Data Lakes

11 Compaction Optimizations for Iceberg Data Lakes

1
Comments
25 min read
Slowly Changing Dimensions: Types 1-3 with Examples
Cover image for Slowly Changing Dimensions: Types 1-3 with Examples

Slowly Changing Dimensions: Types 1-3 with Examples

Comments
4 min read
Data Modeling for the Lakehouse: What Changes
Cover image for Data Modeling for the Lakehouse: What Changes

Data Modeling for the Lakehouse: What Changes

Comments
4 min read
ELI25: Apache Kafka Quick Notes for Interviews

ELI25: Apache Kafka Quick Notes for Interviews

Comments
4 min read
Postmortem: Eliminating OOM Failures in Spark on Kubernetes (Azure) After Cloud Migration
Cover image for Postmortem: Eliminating OOM Failures in Spark on Kubernetes (Azure) After Cloud Migration

Postmortem: Eliminating OOM Failures in Spark on Kubernetes (Azure) After Cloud Migration

Comments
5 min read
A 2026 Introduction to Apache Iceberg
Cover image for A 2026 Introduction to Apache Iceberg

A 2026 Introduction to Apache Iceberg

Comments
6 min read
Data Is Not a Department — It’s a Decision Architecture
Cover image for Data Is Not a Department — It’s a Decision Architecture

Data Is Not a Department — It’s a Decision Architecture

4
Comments
2 min read
Ditch 10,000 Intermediate Tables—Compute Outside the Database with Open-Source SPL

Ditch 10,000 Intermediate Tables—Compute Outside the Database with Open-Source SPL

5
Comments
8 min read
How Analysts Translate Messy Data, DAX, and Dashboards into Action Using Power BI

How Analysts Translate Messy Data, DAX, and Dashboards into Action Using Power BI

1
Comments
4 min read
Hardcoded Selectors vs. AI Prompts: A Resilience Benchmark on Etsy
Cover image for Hardcoded Selectors vs. AI Prompts: A Resilience Benchmark on Etsy

Hardcoded Selectors vs. AI Prompts: A Resilience Benchmark on Etsy

Comments 1
5 min read
Chatting with 3 Billion Base Pairs: Building a RAG Index for Your Personal Genome (WGS)

Chatting with 3 Billion Base Pairs: Building a RAG Index for Your Personal Genome (WGS)

Comments
4 min read
From Script to Spreadsheet: Building a Self-Serve Etsy Competitor Tracker
Cover image for From Script to Spreadsheet: Building a Self-Serve Etsy Competitor Tracker

From Script to Spreadsheet: Building a Self-Serve Etsy Competitor Tracker

2
Comments
5 min read
Schema Evolution Without Breaking Consumers
Cover image for Schema Evolution Without Breaking Consumers

Schema Evolution Without Breaking Consumers

1
Comments
4 min read
Data Quality Is a Pipeline Problem, Not a Dashboard Problem
Cover image for Data Quality Is a Pipeline Problem, Not a Dashboard Problem

Data Quality Is a Pipeline Problem, Not a Dashboard Problem

1
Comments
4 min read
Apache Data Lakehouse Weekly: February 4-11, 2026
Cover image for Apache Data Lakehouse Weekly: February 4-11, 2026

Apache Data Lakehouse Weekly: February 4-11, 2026

Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.