Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Fuzzy-match millions of rows in Databricks (2026)
Cover image for Fuzzy-match millions of rows in Databricks (2026)

Fuzzy-match millions of rows in Databricks (2026)

9
Comments
5 min read
Lakehouse? More Like a Lake + Warehouse Parking Lot

Lakehouse? More Like a Lake + Warehouse Parking Lot

5
Comments
10 min read
Data Modeling Best Practices: 7 Mistakes to Avoid
Cover image for Data Modeling Best Practices: 7 Mistakes to Avoid

Data Modeling Best Practices: 7 Mistakes to Avoid

Comments
4 min read
Why Your AI Initiatives Fail Without a Semantic Layer
Cover image for Why Your AI Initiatives Fail Without a Semantic Layer

Why Your AI Initiatives Fail Without a Semantic Layer

Comments
4 min read
Semantic Layer vs. Metrics Layer: What's the Difference?
Cover image for Semantic Layer vs. Metrics Layer: What's the Difference?

Semantic Layer vs. Metrics Layer: What's the Difference?

Comments
4 min read
Semantic Layer vs. Data Catalog: Complementary, Not Competing
Cover image for Semantic Layer vs. Data Catalog: Complementary, Not Competing

Semantic Layer vs. Data Catalog: Complementary, Not Competing

Comments
4 min read
How to Build a Semantic Layer: A Step-by-Step Guide
Cover image for How to Build a Semantic Layer: A Step-by-Step Guide

How to Build a Semantic Layer: A Step-by-Step Guide

3
Comments
4 min read
What Is a Semantic Layer? A Complete Guide
Cover image for What Is a Semantic Layer? A Complete Guide

What Is a Semantic Layer? A Complete Guide

Comments
4 min read
Data Vault Modeling: Hubs, Links, and Satellites
Cover image for Data Vault Modeling: Hubs, Links, and Satellites

Data Vault Modeling: Hubs, Links, and Satellites

Comments
4 min read
Data Modeling for Analytics: Optimize for Queries, Not Transactions
Cover image for Data Modeling for Analytics: Optimize for Queries, Not Transactions

Data Modeling for Analytics: Optimize for Queries, Not Transactions

Comments
4 min read
🕸️ I Just Deleted My Scraper Boilerplate: Meet the "One-Liner" Crawler

🕸️ I Just Deleted My Scraper Boilerplate: Meet the "One-Liner" Crawler

Comments
3 min read
From Splicing Fibers to Scaling Clouds: My Journey to the AWS Community
Cover image for From Splicing Fibers to Scaling Clouds: My Journey to the AWS Community

From Splicing Fibers to Scaling Clouds: My Journey to the AWS Community

Comments
2 min read
Denormalization: When and Why to Flatten Your Data
Cover image for Denormalization: When and Why to Flatten Your Data

Denormalization: When and Why to Flatten Your Data

Comments
4 min read
Building Production ETL Pipelines in Node.js with HazelJS Data
Cover image for Building Production ETL Pipelines in Node.js with HazelJS Data

Building Production ETL Pipelines in Node.js with HazelJS Data

5
Comments
9 min read
Ten years late to the dbt party (DuckDB edition)
Cover image for Ten years late to the dbt party (DuckDB edition)

Ten years late to the dbt party (DuckDB edition)

Comments
27 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.