Forem

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
One Off to One Data Platform: Design with Intent [Part 2]
Cover image for One Off to One Data Platform: Design with Intent [Part 2]

One Off to One Data Platform: Design with Intent [Part 2]

1
Comments
5 min read
Case Study: Creating an ETL Data Pipeline using AWS Services - Real-World Problem
Cover image for Case Study: Creating an ETL Data Pipeline using AWS Services - Real-World Problem

Case Study: Creating an ETL Data Pipeline using AWS Services - Real-World Problem

Comments
2 min read
Introduction to Apache Kafka

Introduction to Apache Kafka

3
Comments 1
3 min read
Seaborn Cheat Sheet
Cover image for Seaborn Cheat Sheet

Seaborn Cheat Sheet

1
Comments
2 min read
Jupyter Notebooks in Docker
Cover image for Jupyter Notebooks in Docker

Jupyter Notebooks in Docker

11
Comments 1
3 min read
🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines

🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines

4
Comments 1
3 min read
ACID Properties in Databases: What Happens Without Them?
Cover image for ACID Properties in Databases: What Happens Without Them?

ACID Properties in Databases: What Happens Without Them?

5
Comments
6 min read
10 Future Apache Iceberg Developments to Look forward to in 2025
Cover image for 10 Future Apache Iceberg Developments to Look forward to in 2025

10 Future Apache Iceberg Developments to Look forward to in 2025

1
Comments
13 min read
Data Architecture Best Practices
Cover image for Data Architecture Best Practices

Data Architecture Best Practices

1
Comments
6 min read
🚀 Unlock the Power of ORC File Format 📊

🚀 Unlock the Power of ORC File Format 📊

5
Comments
1 min read
Setting up memory for Flink - Configuration

Setting up memory for Flink - Configuration

Comments
3 min read
Designing robust and scalable relational databases: A series of best practices.
Cover image for Designing robust and scalable relational databases: A series of best practices.

Designing robust and scalable relational databases: A series of best practices.

18
Comments 5
17 min read
From Data to Decisions: How Machine Learning Works in 2025

From Data to Decisions: How Machine Learning Works in 2025

3
Comments
3 min read
Why Data Security is Broken and How to Fix it?
Cover image for Why Data Security is Broken and How to Fix it?

Why Data Security is Broken and How to Fix it?

1
Comments
5 min read
Mastering Dynamic Allocation in Apache Spark: A Practical Guide with Real-World Insights

Mastering Dynamic Allocation in Apache Spark: A Practical Guide with Real-World Insights

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.