Forem

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
YouTube Data Processing Pipeline
Cover image for YouTube Data Processing Pipeline

YouTube Data Processing Pipeline

2
Comments 1
4 min read
🔄 ETL vs ELT: What’s the Difference and Why It Matters?
Cover image for 🔄 ETL vs ELT: What’s the Difference and Why It Matters?

🔄 ETL vs ELT: What’s the Difference and Why It Matters?

Comments
2 min read
CDC in AWS: Content Data Capture from AWS RDS MySQL into AWS MSK Kafka topic using Debezium
Cover image for CDC in AWS: Content Data Capture from AWS RDS MySQL into AWS MSK Kafka topic using Debezium

CDC in AWS: Content Data Capture from AWS RDS MySQL into AWS MSK Kafka topic using Debezium

1
Comments
5 min read
LLPY-03: Extracción y Procesamiento Inteligente de Datos Legales
Cover image for LLPY-03: Extracción y Procesamiento Inteligente de Datos Legales

LLPY-03: Extracción y Procesamiento Inteligente de Datos Legales

Comments
21 min read
Create a Microsoft Fabric Lakehouse
Cover image for Create a Microsoft Fabric Lakehouse

Create a Microsoft Fabric Lakehouse

5
Comments
6 min read
🏗️ The Role of a Data Engineer: Beyond Pipelines

🏗️ The Role of a Data Engineer: Beyond Pipelines

Comments
2 min read
Beyond Flat Tables: Model Hierarchical Data in Supabase with Recursive Queries
Cover image for Beyond Flat Tables: Model Hierarchical Data in Supabase with Recursive Queries

Beyond Flat Tables: Model Hierarchical Data in Supabase with Recursive Queries

2
Comments
7 min read
Personal Picks: Data Product News (October 1, 2025)

Personal Picks: Data Product News (October 1, 2025)

Comments
7 min read
From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks
Cover image for From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks

From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks

4
Comments 1
10 min read
Git Integration in Microsoft Fabric

Git Integration in Microsoft Fabric

4
Comments
3 min read
🎯 The Challenge: Processing TBs of S3 Data Without Breaking the Bank

🎯 The Challenge: Processing TBs of S3 Data Without Breaking the Bank

Comments
5 min read
Why Apache Iceberg is needed?

Why Apache Iceberg is needed?

1
Comments
6 min read
Get Started with Fastest SQL Query Engine - Presto C++ (Prestissimo): Beginner Friendly Setup Guide with Docker.
Cover image for Get Started with Fastest SQL Query Engine - Presto C++ (Prestissimo): Beginner Friendly Setup Guide with Docker.

Get Started with Fastest SQL Query Engine - Presto C++ (Prestissimo): Beginner Friendly Setup Guide with Docker.

Comments
5 min read
Automating NASA’s Astronomy Picture of the Day with Airflow

Automating NASA’s Astronomy Picture of the Day with Airflow

Comments
6 min read
10 Best Platforms to Learn Data Analytics in 2026
Cover image for 10 Best Platforms to Learn Data Analytics in 2026

10 Best Platforms to Learn Data Analytics in 2026

3
Comments 1
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.