Forem

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Building Something Radical for Data Infrastructure
Cover image for Building Something Radical for Data Infrastructure

Building Something Radical for Data Infrastructure

2
Comments
1 min read
Using Data Engineering to Track Food Prices and Inflation in Kenya from 2006 to 2025
Cover image for Using Data Engineering to Track Food Prices and Inflation in Kenya from 2006 to 2025

Using Data Engineering to Track Food Prices and Inflation in Kenya from 2006 to 2025

9
Comments
6 min read
Is your Vector Database Really Fast?
Cover image for Is your Vector Database Really Fast?

Is your Vector Database Really Fast?

1
Comments
9 min read
Big Data Fundamentals: big data tutorial

Big Data Fundamentals: big data tutorial

5
Comments
5 min read
Big Data Fundamentals: big data tutorial

Big Data Fundamentals: big data tutorial

5
Comments
6 min read
Big Data Fundamentals: big data tutorial

Big Data Fundamentals: big data tutorial

5
Comments
5 min read
Slowly Changing Dimensions: Strategies for Maintaining History and Integrity in Analytical Systems
Cover image for Slowly Changing Dimensions: Strategies for Maintaining History and Integrity in Analytical Systems

Slowly Changing Dimensions: Strategies for Maintaining History and Integrity in Analytical Systems

1
Comments
8 min read
Big Data Fundamentals: spark tutorial

Big Data Fundamentals: spark tutorial

1
Comments
6 min read
Classes in Python, a beginner's pov

Classes in Python, a beginner's pov

1
Comments
2 min read
Generate Blazing-Fast Ad-Hoc Python Functions From Declarative Rules

Generate Blazing-Fast Ad-Hoc Python Functions From Declarative Rules

Comments
2 min read
🧱 OLTP vs OLAP: When Transaction Meets Analytics

🧱 OLTP vs OLAP: When Transaction Meets Analytics

2
Comments
2 min read
Building a Real-Time Healthcare Data Pipeline with Apache Spark: From SQS to Parquet (Part 2)

Building a Real-Time Healthcare Data Pipeline with Apache Spark: From SQS to Parquet (Part 2)

Comments
8 min read
Virtual Private Database (VPD) | DBMS_RLS | fine-grained access control (FGAC) | mrcaption49

Virtual Private Database (VPD) | DBMS_RLS | fine-grained access control (FGAC) | mrcaption49

5
Comments
5 min read
Kafka Internal Architecture and Mechanisms
Cover image for Kafka Internal Architecture and Mechanisms

Kafka Internal Architecture and Mechanisms

Comments
14 min read
DBMS_SCHEDULER with Practical example | mrcaption49

DBMS_SCHEDULER with Practical example | mrcaption49

5
Comments
4 min read
🖼️ PixelSink: Hunt Hidden Data Inside Images
Cover image for 🖼️ PixelSink: Hunt Hidden Data Inside Images

🖼️ PixelSink: Hunt Hidden Data Inside Images

1
Comments
1 min read
SQL Server 2025 - What’s New and How to Visualize the Schema
Cover image for SQL Server 2025 - What’s New and How to Visualize the Schema

SQL Server 2025 - What’s New and How to Visualize the Schema

14
Comments 1
7 min read
Apache Iceberg Table Optimization #9: Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery
Cover image for Apache Iceberg Table Optimization #9: Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery

Apache Iceberg Table Optimization #9: Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery

Comments
3 min read
Apache Iceberg Table Optimization #10: The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg
Cover image for Apache Iceberg Table Optimization #10: The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg

Apache Iceberg Table Optimization #10: The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg

Comments
3 min read
Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg
Cover image for Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg

Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg

Comments
3 min read
Apache Iceberg Table Optimization #2: The Basics of Compaction — Bin Packing Your Data for Efficiency
Cover image for Apache Iceberg Table Optimization #2: The Basics of Compaction — Bin Packing Your Data for Efficiency

Apache Iceberg Table Optimization #2: The Basics of Compaction — Bin Packing Your Data for Efficiency

Comments
3 min read
🧪 Virtual Environments for Data Engineers — 2025 Edition

🧪 Virtual Environments for Data Engineers — 2025 Edition

Comments
1 min read
Big Data Fundamentals: big data tutorial

Big Data Fundamentals: big data tutorial

1
Comments
5 min read
Big Data Fundamentals: big data tutorial

Big Data Fundamentals: big data tutorial

1
Comments
5 min read
Apache Iceberg Table Optimization #5: Avoiding Metadata Bloat with Snapshot Expiration and Rewriting Manifests
Cover image for Apache Iceberg Table Optimization #5: Avoiding Metadata Bloat with Snapshot Expiration and Rewriting Manifests

Apache Iceberg Table Optimization #5: Avoiding Metadata Bloat with Snapshot Expiration and Rewriting Manifests

Comments
3 min read
loading...