Forem

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Testando com Monkey Patching

Testando com Monkey Patching

Comments
4 min read
Automated Google News Search

Automated Google News Search

3
Comments
1 min read
Aggregation Strategies for Scalable Data Insights: A Technical Perspective
Cover image for Aggregation Strategies for Scalable Data Insights: A Technical Perspective

Aggregation Strategies for Scalable Data Insights: A Technical Perspective

3
Comments
5 min read
🔄 ETL vs ELT: The Backbone of Data Engineering

🔄 ETL vs ELT: The Backbone of Data Engineering

2
Comments
1 min read
🚀Git + Databricks: Why Both Are Essential for Modern Data Engineering
Cover image for 🚀Git + Databricks: Why Both Are Essential for Modern Data Engineering

🚀Git + Databricks: Why Both Are Essential for Modern Data Engineering

1
Comments
2 min read
🚀 Synthetic Data: The Next Frontier for Data Engineers

🚀 Synthetic Data: The Next Frontier for Data Engineers

Comments
2 min read
Pytest: Como Testar Módulos Python com Configuração no Nível Superior

Pytest: Como Testar Módulos Python com Configuração no Nível Superior

Comments
5 min read
Databend Monthly Report: July 2025
Cover image for Databend Monthly Report: July 2025

Databend Monthly Report: July 2025

Comments
3 min read
Scaling Databases with ClickHouse Sharding (Hands-On Simulation)
Cover image for Scaling Databases with ClickHouse Sharding (Hands-On Simulation)

Scaling Databases with ClickHouse Sharding (Hands-On Simulation)

3
Comments
2 min read
Building AI-Powered Data Pipelines: Where Data Engineering Meets Machine Learning
Cover image for Building AI-Powered Data Pipelines: Where Data Engineering Meets Machine Learning

Building AI-Powered Data Pipelines: Where Data Engineering Meets Machine Learning

Comments
2 min read
wget vs. curl: when to use which?

wget vs. curl: when to use which?

Comments
2 min read
Where We Encounter Delimited Data and How We Handle It
Cover image for Where We Encounter Delimited Data and How We Handle It

Where We Encounter Delimited Data and How We Handle It

2
Comments
6 min read
Why Apache Airflow is the Cornerstone of Modern Data Engineering

Why Apache Airflow is the Cornerstone of Modern Data Engineering

Comments
5 min read
🚀 How PySpark Helps Handle Terabytes of Data Easily
Cover image for 🚀 How PySpark Helps Handle Terabytes of Data Easily

🚀 How PySpark Helps Handle Terabytes of Data Easily

Comments
2 min read
Building a Data Mart in Amazon Redshift: A Practical Guide
Cover image for Building a Data Mart in Amazon Redshift: A Practical Guide

Building a Data Mart in Amazon Redshift: A Practical Guide

Comments
6 min read
Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices
Cover image for Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices

1
Comments 1
6 min read
Apache Arrow dev list digest (Aug 25–29 2025)

Apache Arrow dev list digest (Aug 25–29 2025)

Comments
4 min read
(I) Principles of Data Model Architecture: Four Layers and Seven Stages

(I) Principles of Data Model Architecture: Four Layers and Seven Stages

5
Comments
7 min read
Revamping Real-Time Data Ingestion for Scalable Media Intelligence
Cover image for Revamping Real-Time Data Ingestion for Scalable Media Intelligence

Revamping Real-Time Data Ingestion for Scalable Media Intelligence

Comments
4 min read
Scraping the Schema of NetSuite

Scraping the Schema of NetSuite

Comments
2 min read
You Can't Trust COUNT and SUM: Scalable Data Validation with Merkle Trees
Cover image for You Can't Trust COUNT and SUM: Scalable Data Validation with Merkle Trees

You Can't Trust COUNT and SUM: Scalable Data Validation with Merkle Trees

2
Comments 1
8 min read
Docker for Data Engineers: The Complete Beginner’s Guide
Cover image for Docker for Data Engineers: The Complete Beginner’s Guide

Docker for Data Engineers: The Complete Beginner’s Guide

4
Comments
6 min read
Dynamic Routing Lightweight ETL with AWS Lambda, DuckDB, and PyIceberg

Dynamic Routing Lightweight ETL with AWS Lambda, DuckDB, and PyIceberg

5
Comments 4
4 min read
Lightweight ETL with AWS Lambda, DuckDB, and delta-rs

Lightweight ETL with AWS Lambda, DuckDB, and delta-rs

3
Comments
5 min read
🚀 The Future of Data Engineering: How AI and Automation are Changing the Game
Cover image for 🚀 The Future of Data Engineering: How AI and Automation are Changing the Game

🚀 The Future of Data Engineering: How AI and Automation are Changing the Game

Comments
2 min read
loading...