Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Pandas 3.0's PyArrow String Revolution: A Deep Dive into Memory and Performance
Cover image for Pandas 3.0's PyArrow String Revolution: A Deep Dive into Memory and Performance

Pandas 3.0's PyArrow String Revolution: A Deep Dive into Memory and Performance

7
Comments
6 min read
Ask Our AI Experts: An AMA With Our Tech Leads
Cover image for Ask Our AI Experts: An AMA With Our Tech Leads

Ask Our AI Experts: An AMA With Our Tech Leads

Comments
3 min read
11 Compaction Optimizations for Iceberg Data Lakes

11 Compaction Optimizations for Iceberg Data Lakes

1
Comments
25 min read
Garbage In, Powerhouse Out? (Nope.) Why Your Data Foundation Matters More Than AI
Cover image for Garbage In, Powerhouse Out? (Nope.) Why Your Data Foundation Matters More Than AI

Garbage In, Powerhouse Out? (Nope.) Why Your Data Foundation Matters More Than AI

1
Comments
4 min read
Schemas and Data Modelling in Power BI

Schemas and Data Modelling in Power BI

3
Comments
7 min read
Configuring Gravitino Iceberg REST Catalog Server
Cover image for Configuring Gravitino Iceberg REST Catalog Server

Configuring Gravitino Iceberg REST Catalog Server

3
Comments 1
6 min read
Under the Hood of Arisyn: How Statistical Field Fingerprinting Enables Deterministic Data Linking

Under the Hood of Arisyn: How Statistical Field Fingerprinting Enables Deterministic Data Linking

5
Comments 1
2 min read
Analytics Engineering

Analytics Engineering

Comments
1 min read
We All Accepted the "Python Tax.", Pandas 3.0 Just Reduced It.
Cover image for We All Accepted the "Python Tax.", Pandas 3.0 Just Reduced It.

We All Accepted the "Python Tax.", Pandas 3.0 Just Reduced It.

5
Comments
2 min read
Data Relationships Are a First-Class Problem in Modern Data Systems

Data Relationships Are a First-Class Problem in Modern Data Systems

5
Comments 1
2 min read
Your 2026 Resolution: Add Context to Your Data (Before It Breaks You)

Your 2026 Resolution: Add Context to Your Data (Before It Breaks You)

Comments
10 min read
The Natasha Problem: Why Your Data Pipeline Only Fits One Person

The Natasha Problem: Why Your Data Pipeline Only Fits One Person

Comments
5 min read
A Pragmatic, Event-Driven Serverless Data Architecture
Cover image for A Pragmatic, Event-Driven Serverless Data Architecture

A Pragmatic, Event-Driven Serverless Data Architecture

5
Comments
4 min read
Before Big Data: 3 Key Discoveries That Changed Business Strategy Forever

Before Big Data: 3 Key Discoveries That Changed Business Strategy Forever

Comments
4 min read
A 2026 Introduction to Apache Iceberg
Cover image for A 2026 Introduction to Apache Iceberg

A 2026 Introduction to Apache Iceberg

Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.