Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
The Three Phases of Data Pipelines

The Three Phases of Data Pipelines

Comments
4 min read
Schemas and Data Modelling in Power B.I
Cover image for Schemas and Data Modelling in Power B.I

Schemas and Data Modelling in Power B.I

1
Comments
3 min read
From Data Chaos to Executable Graphs: Turning Relationships into Infrastructure

From Data Chaos to Executable Graphs: Turning Relationships into Infrastructure

4
Comments 1
2 min read
Architecture of a 6TB Media Pipeline: Engineering Real-Time Content at Bharat Drone Shakti
Cover image for Architecture of a 6TB Media Pipeline: Engineering Real-Time Content at Bharat Drone Shakti

Architecture of a 6TB Media Pipeline: Engineering Real-Time Content at Bharat Drone Shakti

Comments
6 min read
Why Columnar Storage Makes Analytics Faster

Why Columnar Storage Makes Analytics Faster

1
Comments
1 min read
Are Wide Tables Fast or Slow?

Are Wide Tables Fast or Slow?

5
Comments
4 min read
HOW TO GIT IT

HOW TO GIT IT

Comments
3 min read
Data Engineering Basics: From What is Data to Modern Lakehouse Architecture

Data Engineering Basics: From What is Data to Modern Lakehouse Architecture

7
Comments
4 min read
Your ML Model Isn’t Wrong. Your SQL Probably Is.
Cover image for Your ML Model Isn’t Wrong. Your SQL Probably Is.

Your ML Model Isn’t Wrong. Your SQL Probably Is.

Comments 1
2 min read
Traditional vs Modern Data Architecture

Traditional vs Modern Data Architecture

7
Comments
4 min read
Tableau + Databricks at Scale: A Technical Guide for Managing 10,000+ Databases

Tableau + Databricks at Scale: A Technical Guide for Managing 10,000+ Databases

Comments
5 min read
Making AI Data Flows Visible: Building an Open-Source Tool to Understand SaaS & LLM Data Risk
Cover image for Making AI Data Flows Visible: Building an Open-Source Tool to Understand SaaS & LLM Data Risk

Making AI Data Flows Visible: Building an Open-Source Tool to Understand SaaS & LLM Data Risk

1
Comments
3 min read
Apache Iceberg & the Open Data Stack: Why the Lakehouse is Real in 2026
Cover image for Apache Iceberg & the Open Data Stack: Why the Lakehouse is Real in 2026

Apache Iceberg & the Open Data Stack: Why the Lakehouse is Real in 2026

Comments
8 min read
From Silent None to Insight: Debugging PySpark UDFs on AWS Glue with Decorators

From Silent None to Insight: Debugging PySpark UDFs on AWS Glue with Decorators

Comments
8 min read
Dimensional Modeling: Facts, Dimensions, and Grains
Cover image for Dimensional Modeling: Facts, Dimensions, and Grains

Dimensional Modeling: Facts, Dimensions, and Grains

2
Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.