Forem

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Create a Microsoft Fabric Lakehouse
Cover image for Create a Microsoft Fabric Lakehouse

Create a Microsoft Fabric Lakehouse

5
Comments
6 min read
Core Concepts of Kafka

Core Concepts of Kafka

Comments
8 min read
LLPY-03: Extracción y Procesamiento Inteligente de Datos Legales
Cover image for LLPY-03: Extracción y Procesamiento Inteligente de Datos Legales

LLPY-03: Extracción y Procesamiento Inteligente de Datos Legales

Comments
21 min read
Beyond Flat Tables: Model Hierarchical Data in Supabase with Recursive Queries
Cover image for Beyond Flat Tables: Model Hierarchical Data in Supabase with Recursive Queries

Beyond Flat Tables: Model Hierarchical Data in Supabase with Recursive Queries

Comments
7 min read
🚀 How I Learned That Thinking “Small” Can Save Hours of SQL Headaches
Cover image for 🚀 How I Learned That Thinking “Small” Can Save Hours of SQL Headaches

🚀 How I Learned That Thinking “Small” Can Save Hours of SQL Headaches

Comments
2 min read
Personal Picks: Data Product News (October 1, 2025)

Personal Picks: Data Product News (October 1, 2025)

Comments
7 min read
Git Integration in Microsoft Fabric

Git Integration in Microsoft Fabric

3
Comments
3 min read
🎯 The Challenge: Processing TBs of S3 Data Without Breaking the Bank

🎯 The Challenge: Processing TBs of S3 Data Without Breaking the Bank

Comments
5 min read
Get Started with Fastest SQL Query Engine - Presto C++ (Prestissimo): Beginner Friendly Setup Guide with Docker.
Cover image for Get Started with Fastest SQL Query Engine - Presto C++ (Prestissimo): Beginner Friendly Setup Guide with Docker.

Get Started with Fastest SQL Query Engine - Presto C++ (Prestissimo): Beginner Friendly Setup Guide with Docker.

Comments
5 min read
10 Best Platforms to Learn Data Analytics in 2026
Cover image for 10 Best Platforms to Learn Data Analytics in 2026

10 Best Platforms to Learn Data Analytics in 2026

1
Comments
4 min read
Apache Zookeeper: O coordenador de sistemas distribuídos

Apache Zookeeper: O coordenador de sistemas distribuídos

Comments
8 min read
Debezium: Capturando mudanças de dados em tempo real

Debezium: Capturando mudanças de dados em tempo real

Comments
3 min read
Change Data Capture (CDC): Capturando mudanças em tempo real

Change Data Capture (CDC): Capturando mudanças em tempo real

Comments
4 min read
Streams de Dados: Processamento de Informações em Tempo Real

Streams de Dados: Processamento de Informações em Tempo Real

Comments
3 min read
From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks
Cover image for From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks

From Kafka to Clean Tables: Building a Confluent Snowflake Pipeline with Streams & Tasks

3
Comments 1
10 min read
Big Data Analytics with PySpark: A Beginner-Friendly Guide

Big Data Analytics with PySpark: A Beginner-Friendly Guide

1
Comments
4 min read
Usando Funções de Ordem Superior (Higher-Order Functions - HOFs)

Usando Funções de Ordem Superior (Higher-Order Functions - HOFs)

Comments
4 min read
Change Data Capture (CDC) in Data Engineering: Concepts, Tools, and Real-World Implementation Strategies

Change Data Capture (CDC) in Data Engineering: Concepts, Tools, and Real-World Implementation Strategies

1
Comments
5 min read
📘 Foundation Phase Completed - Starting Phase 2 of My Journey
Cover image for 📘 Foundation Phase Completed - Starting Phase 2 of My Journey

📘 Foundation Phase Completed - Starting Phase 2 of My Journey

Comments
3 min read
A Beginner’s Guide to Big Data Analytics with Apache Spark and PySpark

A Beginner’s Guide to Big Data Analytics with Apache Spark and PySpark

Comments
4 min read
Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Comments
4 min read
Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Comments
7 min read
10 Best Platforms to Learn Data Engineering in 2026
Cover image for 10 Best Platforms to Learn Data Engineering in 2026

10 Best Platforms to Learn Data Engineering in 2026

Comments
4 min read
Building a Task Manager with Apache NiFi: From Custom Scheduler to Distributed Workflows

Building a Task Manager with Apache NiFi: From Custom Scheduler to Distributed Workflows

4
Comments
9 min read
Azure Data Factory (ADF) - A Beginner's Guide to Cloud Data Integration
Cover image for Azure Data Factory (ADF) - A Beginner's Guide to Cloud Data Integration

Azure Data Factory (ADF) - A Beginner's Guide to Cloud Data Integration

2
Comments
4 min read
loading...