Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
The X-Axis of Data: How BigQuery Models the Customer Dimension for Agentic AI Decisions

The X-Axis of Data: How BigQuery Models the Customer Dimension for Agentic AI Decisions

1
Comments
5 min read
Reproducibility of Analytical Decisions: Introducing a Deterministic Analytical Runtime

Reproducibility of Analytical Decisions: Introducing a Deterministic Analytical Runtime

1
Comments
3 min read
Why Shannon Entropy Catches What Schema Validation Misses
Cover image for Why Shannon Entropy Catches What Schema Validation Misses

Why Shannon Entropy Catches What Schema Validation Misses

Comments
5 min read
What is the difference between ETL and ELT?

What is the difference between ETL and ELT?

2
Comments 2
8 min read
dbt snapshots: moving from merges to native history

dbt snapshots: moving from merges to native history

1
Comments
5 min read
Batch Processing with Apache Spark

Batch Processing with Apache Spark

Comments
1 min read
Turing Completeness in Reactivity
Cover image for Turing Completeness in Reactivity

Turing Completeness in Reactivity

5
Comments
4 min read
A Beginner's Guide to SQL Joins and Window Functions

A Beginner's Guide to SQL Joins and Window Functions

1
Comments
6 min read
ETL VS ELT: WHICH ONE SHOULD YOU USE AND WHY?

ETL VS ELT: WHICH ONE SHOULD YOU USE AND WHY?

1
Comments
5 min read
The Backyard Quarry, Part 2: Designing a Schema for Physical Objects

The Backyard Quarry, Part 2: Designing a Schema for Physical Objects

2
Comments
5 min read
dbt docs

dbt docs

1
Comments
7 min read
AWS Lake Formation: Why Your Data Lake Permissions Are Probably a Mess (And How to Fix That)

AWS Lake Formation: Why Your Data Lake Permissions Are Probably a Mess (And How to Fix That)

Comments
3 min read
Data Engineering Interview Prep (2026): What Actually Matters (SQL, Pipelines, System Design)
Cover image for Data Engineering Interview Prep (2026): What Actually Matters (SQL, Pipelines, System Design)

Prioritizes clear thinking under pressure

Data Engineering Interview Prep (2026): What Actually Matters (SQL, Pipelines, System Design)

74
Comments 12
8 min read
How We Generate AI Network Digests for MegaETH at MiniBlocks.io

How We Generate AI Network Digests for MegaETH at MiniBlocks.io

1
Comments
8 min read
Understanding Vector Pipelines: From Config Files to Data Flow
Cover image for Understanding Vector Pipelines: From Config Files to Data Flow

Understanding Vector Pipelines: From Config Files to Data Flow

2
Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.