Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Designing a Layered YouTube Analytics Pipeline with AWS Bedrock (Architecture Overview)
Cover image for Designing a Layered YouTube Analytics Pipeline with AWS Bedrock (Architecture Overview)

Designing a Layered YouTube Analytics Pipeline with AWS Bedrock (Architecture Overview)

Comments
2 min read
Understanding schemas and data modelling in Power BI
Cover image for Understanding schemas and data modelling in Power BI

Understanding schemas and data modelling in Power BI

2
Comments
4 min read
Scaling Fuzzy Matching: From Local Scripts to Production Pipelines

Scaling Fuzzy Matching: From Local Scripts to Production Pipelines

7
Comments
5 min read
Offloading Statistical Computations to BigQuery: Efficient EDA with Python and Seaborn

Offloading Statistical Computations to BigQuery: Efficient EDA with Python and Seaborn

1
Comments
2 min read
AI Data Engineer Skills Deep-Dive: Entry-Level Reality + Senior Differentiators (Follow-up to Part 1)
Cover image for AI Data Engineer Skills Deep-Dive: Entry-Level Reality + Senior Differentiators (Follow-up to Part 1)

AI Data Engineer Skills Deep-Dive: Entry-Level Reality + Senior Differentiators (Follow-up to Part 1)

Comments
4 min read
Why Data Teams Still “Guess” Join Keys in 2026

Why Data Teams Still “Guess” Join Keys in 2026

Comments 1
2 min read
How DiDi Scaled to Hundreds of Petabytes with Apache Ozone

How DiDi Scaled to Hundreds of Petabytes with Apache Ozone

Comments
4 min read
XLTable: Bringing the OLAP Experience Back to Excel on Modern Data Warehouses

XLTable: Bringing the OLAP Experience Back to Excel on Modern Data Warehouses

Comments
4 min read
Stop Bad Data From Breaking Your Pipelines — A Python Data Quality Framework
Cover image for Stop Bad Data From Breaking Your Pipelines — A Python Data Quality Framework

Stop Bad Data From Breaking Your Pipelines — A Python Data Quality Framework

Comments
3 min read
How to Implement Data Modelling in Power BI
Cover image for How to Implement Data Modelling in Power BI

How to Implement Data Modelling in Power BI

2
Comments
2 min read
AI Data Engineer vs Data Engineer: What Actually Changed? (50+ Job Analysis)
Cover image for AI Data Engineer vs Data Engineer: What Actually Changed? (50+ Job Analysis)

AI Data Engineer vs Data Engineer: What Actually Changed? (50+ Job Analysis)

Comments
4 min read
Designing a Modern Data Warehouse: Combining Bill Inmon and Ralph Kimball in a Hybrid Medallion Architecture

Designing a Modern Data Warehouse: Combining Bill Inmon and Ralph Kimball in a Hybrid Medallion Architecture

2
Comments
3 min read
Mitigating 'Scraping Shock': Engineering Cost-Aware Data Pipelines
Cover image for Mitigating 'Scraping Shock': Engineering Cost-Aware Data Pipelines

Mitigating 'Scraping Shock': Engineering Cost-Aware Data Pipelines

Comments
5 min read
JSONPath Is In! The AI Assistant Will See You Now
Cover image for JSONPath Is In! The AI Assistant Will See You Now

JSONPath Is In! The AI Assistant Will See You Now

Comments
4 min read
How I automated MongoDB JSON Flattening for Analytics (No ETL)
Cover image for How I automated MongoDB JSON Flattening for Analytics (No ETL)

How I automated MongoDB JSON Flattening for Analytics (No ETL)

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.