Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Testing Data Pipelines: What to Validate and When
Alex Merced
Alex Merced
Alex Merced
Follow
Feb 25
Testing Data Pipelines: What to Validate and When
#
data
#
dataengineering
#
softwareengineering
#
testing
1
 reaction
Comments
Add Comment
4 min read
From AI Visibility to AI Governance: Building a Local-First LLM Cost & Risk Optimizer
Harris Bashir
Harris Bashir
Harris Bashir
Follow
Feb 5
From AI Visibility to AI Governance: Building a Local-First LLM Cost & Risk Optimizer
#
ai
#
dataengineering
#
opensource
#
governance
Comments
Add Comment
4 min read
Data Engineering ZoomCamp Module 1 Notes Part 1
Abdelrahman Adnan
Abdelrahman Adnan
Abdelrahman Adnan
Follow
Jan 27
Data Engineering ZoomCamp Module 1 Notes Part 1
#
beginners
#
dataengineering
#
docker
#
tutorial
1
 reaction
Comments
Add Comment
3 min read
Streaming Crypto Changes: A Practical Guide to Real-Time Data Pipelines with Debezium CDC
J M
J M
J M
Follow
Jan 22
Streaming Crypto Changes: A Practical Guide to Real-Time Data Pipelines with Debezium CDC
#
dataengineering
#
data
#
docker
Comments
Add Comment
3 min read
Fuzzy-match millions of rows in Databricks (2026)
Siyana Hristova
Siyana Hristova
Siyana Hristova
Follow
Feb 25
Fuzzy-match millions of rows in Databricks (2026)
#
datascience
#
dataengineering
#
databricks
#
bigdata
9
 reactions
Comments
Add Comment
5 min read
AI Agents in Data Analytics: How Adeloop Bridges Autonomous Intelligence and Users
Adeloop
Adeloop
Adeloop
Follow
Feb 25
AI Agents in Data Analytics: How Adeloop Bridges Autonomous Intelligence and Users
#
ai
#
dataengineering
#
architecture
#
agents
1
 reaction
Comments
1
 comment
2 min read
Lakehouse? More Like a Lake + Warehouse Parking Lot
Judy
Judy
Judy
Follow
Jan 22
Lakehouse? More Like a Lake + Warehouse Parking Lot
#
architecture
#
data
#
database
#
dataengineering
5
 reactions
Comments
Add Comment
10 min read
Why AI Models Fail in Production — Even When Accuracy Looks High
Naanhe Gujral
Naanhe Gujral
Naanhe Gujral
Follow
Jan 22
Why AI Models Fail in Production — Even When Accuracy Looks High
#
ai
#
machinelearning
#
devops
#
dataengineering
Comments
Add Comment
1 min read
🕸️ I Just Deleted My Scraper Boilerplate: Meet the "One-Liner" Crawler
Siddhesh Surve
Siddhesh Surve
Siddhesh Surve
Follow
Jan 22
🕸️ I Just Deleted My Scraper Boilerplate: Meet the "One-Liner" Crawler
#
python
#
opensource
#
productivity
#
dataengineering
Comments
Add Comment
3 min read
From Splicing Fibers to Scaling Clouds: My Journey to the AWS Community
maureen chepkirui
maureen chepkirui
maureen chepkirui
Follow
Jan 21
From Splicing Fibers to Scaling Clouds: My Journey to the AWS Community
#
aws
#
cloudcomputing
#
dataengineering
#
fiberoptics
Comments
Add Comment
2 min read
Manual Relationship Discovery Does Not Scale.Not Even With SQL.
Hello Arisyn
Hello Arisyn
Hello Arisyn
Follow
Feb 4
Manual Relationship Discovery Does Not Scale.Not Even With SQL.
#
dataengineering
#
dataintegration
#
dataarchitecture
#
scalability
Comments
1
 comment
2 min read
Building an Automated Data Pipeline
maureen chepkirui
maureen chepkirui
maureen chepkirui
Follow
Jan 21
Building an Automated Data Pipeline
#
aws
#
dataengineering
#
python
#
cloud
Comments
Add Comment
2 min read
Linux for Data Engineers: From Terminal to Text Editing
Edmund Eryuba
Edmund Eryuba
Edmund Eryuba
Follow
Jan 25
Linux for Data Engineers: From Terminal to Text Editing
#
linux
#
dataengineering
#
opensource
Comments
Add Comment
16 min read
Building Production ETL Pipelines in Node.js with HazelJS Data
Muhammad Arslan
Muhammad Arslan
Muhammad Arslan
Follow
Feb 23
Building Production ETL Pipelines in Node.js with HazelJS Data
#
dataengineering
#
node
#
tutorial
#
typescript
Comments
Add Comment
9 min read
Scaling Relationship Discovery Across 100,000+ Fields Without Breaking Compute
Hello Arisyn
Hello Arisyn
Hello Arisyn
Follow
Feb 23
Scaling Relationship Discovery Across 100,000+ Fields Without Breaking Compute
#
scalablesystems
#
dataengineering
#
distributedsystems
#
bigdata
1
 reaction
Comments
1
 comment
2 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account