Forem

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Roadmap de Engenharia de Dados para 2025
Cover image for Roadmap de Engenharia de Dados para 2025

Roadmap de Engenharia de Dados para 2025

27
Comments 1
14 min read
⚡ 3 ClickHouse Query Optimization Lessons I Learned the Hard Way
Cover image for ⚡ 3 ClickHouse Query Optimization Lessons I Learned the Hard Way

⚡ 3 ClickHouse Query Optimization Lessons I Learned the Hard Way

5
Comments
3 min read
Real-Time Data Sync: 4 Questions We Get All the Time
Cover image for Real-Time Data Sync: 4 Questions We Get All the Time

Real-Time Data Sync: 4 Questions We Get All the Time

Comments
4 min read
Unity Catalog in Azure Databricks — Everything You Need to Know

Unity Catalog in Azure Databricks — Everything You Need to Know

Comments
2 min read
🧭 Data Mesh vs Data Fabric (Part 1) – Rethinking How We Scale Data

🧭 Data Mesh vs Data Fabric (Part 1) – Rethinking How We Scale Data

1
Comments
2 min read
Building Something Radical for Data Infrastructure
Cover image for Building Something Radical for Data Infrastructure

Building Something Radical for Data Infrastructure

2
Comments
1 min read
Using Data Engineering to Track Food Prices and Inflation in Kenya from 2006 to 2025
Cover image for Using Data Engineering to Track Food Prices and Inflation in Kenya from 2006 to 2025

Using Data Engineering to Track Food Prices and Inflation in Kenya from 2006 to 2025

9
Comments
6 min read
Is your Vector Database Really Fast?
Cover image for Is your Vector Database Really Fast?

Is your Vector Database Really Fast?

1
Comments
9 min read
Big Data Fundamentals: big data tutorial

Big Data Fundamentals: big data tutorial

5
Comments
5 min read
Big Data Fundamentals: big data tutorial

Big Data Fundamentals: big data tutorial

5
Comments
6 min read
Big Data Fundamentals: big data tutorial

Big Data Fundamentals: big data tutorial

5
Comments
5 min read
Slowly Changing Dimensions: Strategies for Maintaining History and Integrity in Analytical Systems
Cover image for Slowly Changing Dimensions: Strategies for Maintaining History and Integrity in Analytical Systems

Slowly Changing Dimensions: Strategies for Maintaining History and Integrity in Analytical Systems

1
Comments
8 min read
Big Data Fundamentals: spark tutorial

Big Data Fundamentals: spark tutorial

1
Comments
6 min read
Classes in Python, a beginner's pov

Classes in Python, a beginner's pov

1
Comments
2 min read
Generate Blazing-Fast Ad-Hoc Python Functions From Declarative Rules

Generate Blazing-Fast Ad-Hoc Python Functions From Declarative Rules

Comments
2 min read
🧱 OLTP vs OLAP: When Transaction Meets Analytics

🧱 OLTP vs OLAP: When Transaction Meets Analytics

2
Comments
2 min read
Building a Real-Time Healthcare Data Pipeline with Apache Spark: From SQS to Parquet (Part 2)

Building a Real-Time Healthcare Data Pipeline with Apache Spark: From SQS to Parquet (Part 2)

Comments
8 min read
Virtual Private Database (VPD) | DBMS_RLS | fine-grained access control (FGAC) | mrcaption49

Virtual Private Database (VPD) | DBMS_RLS | fine-grained access control (FGAC) | mrcaption49

5
Comments
5 min read
Kafka Internal Architecture and Mechanisms
Cover image for Kafka Internal Architecture and Mechanisms

Kafka Internal Architecture and Mechanisms

Comments
14 min read
DBMS_SCHEDULER with Practical example | mrcaption49

DBMS_SCHEDULER with Practical example | mrcaption49

5
Comments
4 min read
🖼️ PixelSink: Hunt Hidden Data Inside Images
Cover image for 🖼️ PixelSink: Hunt Hidden Data Inside Images

🖼️ PixelSink: Hunt Hidden Data Inside Images

1
Comments
1 min read
SQL Server 2025 - What’s New and How to Visualize the Schema
Cover image for SQL Server 2025 - What’s New and How to Visualize the Schema

SQL Server 2025 - What’s New and How to Visualize the Schema

14
Comments 1
7 min read
Apache Iceberg Table Optimization #10: The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg
Cover image for Apache Iceberg Table Optimization #10: The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg

Apache Iceberg Table Optimization #10: The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg

Comments
3 min read
Apache Iceberg Table Optimization #9: Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery
Cover image for Apache Iceberg Table Optimization #9: Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery

Apache Iceberg Table Optimization #9: Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery

Comments
3 min read
Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg
Cover image for Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg

Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg

Comments
3 min read
loading...