Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Testing Data Pipelines: What to Validate and When
Cover image for Testing Data Pipelines: What to Validate and When

Testing Data Pipelines: What to Validate and When

1
Comments
4 min read
From AI Visibility to AI Governance: Building a Local-First LLM Cost & Risk Optimizer
Cover image for From AI Visibility to AI Governance: Building a Local-First LLM Cost & Risk Optimizer

From AI Visibility to AI Governance: Building a Local-First LLM Cost & Risk Optimizer

Comments
4 min read
Data Engineering ZoomCamp Module 1 Notes Part 1

Data Engineering ZoomCamp Module 1 Notes Part 1

1
Comments
3 min read
Streaming Crypto Changes: A Practical Guide to Real-Time Data Pipelines with Debezium CDC

Streaming Crypto Changes: A Practical Guide to Real-Time Data Pipelines with Debezium CDC

Comments
3 min read
Fuzzy-match millions of rows in Databricks (2026)
Cover image for Fuzzy-match millions of rows in Databricks (2026)

Fuzzy-match millions of rows in Databricks (2026)

9
Comments
5 min read
AI Agents in Data Analytics: How Adeloop Bridges Autonomous Intelligence and Users
Cover image for AI Agents in Data Analytics: How Adeloop Bridges Autonomous Intelligence and Users

AI Agents in Data Analytics: How Adeloop Bridges Autonomous Intelligence and Users

1
Comments 1
2 min read
Lakehouse? More Like a Lake + Warehouse Parking Lot

Lakehouse? More Like a Lake + Warehouse Parking Lot

5
Comments
10 min read
Why AI Models Fail in Production — Even When Accuracy Looks High

Why AI Models Fail in Production — Even When Accuracy Looks High

Comments
1 min read
🕸️ I Just Deleted My Scraper Boilerplate: Meet the "One-Liner" Crawler

🕸️ I Just Deleted My Scraper Boilerplate: Meet the "One-Liner" Crawler

Comments
3 min read
From Splicing Fibers to Scaling Clouds: My Journey to the AWS Community
Cover image for From Splicing Fibers to Scaling Clouds: My Journey to the AWS Community

From Splicing Fibers to Scaling Clouds: My Journey to the AWS Community

Comments
2 min read
Manual Relationship Discovery Does Not Scale.Not Even With SQL.

Manual Relationship Discovery Does Not Scale.Not Even With SQL.

Comments 1
2 min read
Building an Automated Data Pipeline
Cover image for Building an Automated Data Pipeline

Building an Automated Data Pipeline

Comments
2 min read
Linux for Data Engineers: From Terminal to Text Editing

Linux for Data Engineers: From Terminal to Text Editing

Comments
16 min read
Building Production ETL Pipelines in Node.js with HazelJS Data
Cover image for Building Production ETL Pipelines in Node.js with HazelJS Data

Building Production ETL Pipelines in Node.js with HazelJS Data

Comments
9 min read
Scaling Relationship Discovery Across 100,000+ Fields Without Breaking Compute

Scaling Relationship Discovery Across 100,000+ Fields Without Breaking Compute

1
Comments 1
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.