Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
The Three Phases of Data Pipelines

The Three Phases of Data Pipelines

Comments
4 min read
Schemas and Data Modelling in Power B.I
Cover image for Schemas and Data Modelling in Power B.I

Schemas and Data Modelling in Power B.I

1
Comments
3 min read
Architecture of a 6TB Media Pipeline: Engineering Real-Time Content at Bharat Drone Shakti
Cover image for Architecture of a 6TB Media Pipeline: Engineering Real-Time Content at Bharat Drone Shakti

Architecture of a 6TB Media Pipeline: Engineering Real-Time Content at Bharat Drone Shakti

Comments
6 min read
Why Columnar Storage Makes Analytics Faster

Why Columnar Storage Makes Analytics Faster

1
Comments
1 min read
Are Wide Tables Fast or Slow?

Are Wide Tables Fast or Slow?

5
Comments
4 min read
HOW TO GIT IT

HOW TO GIT IT

Comments
3 min read
Your ML Model Isn’t Wrong. Your SQL Probably Is.
Cover image for Your ML Model Isn’t Wrong. Your SQL Probably Is.

Your ML Model Isn’t Wrong. Your SQL Probably Is.

Comments 1
2 min read
Tableau + Databricks at Scale: A Technical Guide for Managing 10,000+ Databases

Tableau + Databricks at Scale: A Technical Guide for Managing 10,000+ Databases

Comments
5 min read
Making AI Data Flows Visible: Building an Open-Source Tool to Understand SaaS & LLM Data Risk
Cover image for Making AI Data Flows Visible: Building an Open-Source Tool to Understand SaaS & LLM Data Risk

Making AI Data Flows Visible: Building an Open-Source Tool to Understand SaaS & LLM Data Risk

1
Comments
3 min read
Deploying Prefect Workflows on Cloud Run with Cloud SQL (Production-Ready GCP Setup)

Deploying Prefect Workflows on Cloud Run with Cloud SQL (Production-Ready GCP Setup)

1
Comments
1 min read
An Introduction to Git: Concepts, Commands, and Workflows

An Introduction to Git: Concepts, Commands, and Workflows

Comments
4 min read
Apache Iceberg & the Open Data Stack: Why the Lakehouse is Real in 2026
Cover image for Apache Iceberg & the Open Data Stack: Why the Lakehouse is Real in 2026

Apache Iceberg & the Open Data Stack: Why the Lakehouse is Real in 2026

Comments
8 min read
Learning Git & GitHub as a Data Engineering Student at LuxDevHQ

Learning Git & GitHub as a Data Engineering Student at LuxDevHQ

Comments
3 min read
From Silent None to Insight: Debugging PySpark UDFs on AWS Glue with Decorators

From Silent None to Insight: Debugging PySpark UDFs on AWS Glue with Decorators

Comments
8 min read
Dimensional Modeling: Facts, Dimensions, and Grains
Cover image for Dimensional Modeling: Facts, Dimensions, and Grains

Dimensional Modeling: Facts, Dimensions, and Grains

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.