Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Day 15: Running Spark in the Cloud - Dataproc vs Databricks
Cover image for Day 15: Running Spark in the Cloud - Dataproc vs Databricks

Day 15: Running Spark in the Cloud - Dataproc vs Databricks

Comments
2 min read
Rethinking Stream-Batch Unification: Real-Time Processing with Incremental Materialized Views in Apache Cloudberry

Rethinking Stream-Batch Unification: Real-Time Processing with Incremental Materialized Views in Apache Cloudberry

Comments
5 min read
Interesting links - December 2025

Interesting links - December 2025

Comments
13 min read
Data Engineering Processes: From Raw Data to Cleaned, Processed, Analytics-Ready Data.
Cover image for Data Engineering Processes: From Raw Data to Cleaned, Processed, Analytics-Ready Data.

Data Engineering Processes: From Raw Data to Cleaned, Processed, Analytics-Ready Data.

Comments
5 min read
Navigating the Future: Top Data Engineering Trends Shaping 2024 and Beyond

Navigating the Future: Top Data Engineering Trends Shaping 2024 and Beyond

Comments
4 min read
Apache Airflow: Complete Guide for Basic to Advanced Developers
Cover image for Apache Airflow: Complete Guide for Basic to Advanced Developers

Apache Airflow: Complete Guide for Basic to Advanced Developers

1
Comments
22 min read
Day 14: Building a Real Retail Analytics Pipeline Using Spark Window Functions
Cover image for Day 14: Building a Real Retail Analytics Pipeline Using Spark Window Functions

Day 14: Building a Real Retail Analytics Pipeline Using Spark Window Functions

Comments
1 min read
How to Set Up GPG Keys for an Existing GitHub Account (Step-by-Step)

How to Set Up GPG Keys for an Existing GitHub Account (Step-by-Step)

Comments
2 min read
LET'S GIT IT—A Beginner's Guide to Version Control.
Cover image for LET'S GIT IT—A Beginner's Guide to Version Control.

LET'S GIT IT—A Beginner's Guide to Version Control.

4
Comments 1
3 min read
Day 13: Window Functions in PySpark
Cover image for Day 13: Window Functions in PySpark

Day 13: Window Functions in PySpark

Comments
2 min read
Introduction to Version Control with Git and GitHub

Introduction to Version Control with Git and GitHub

1
Comments 2
3 min read
Is CsvPath an easy or hard language?
Cover image for Is CsvPath an easy or hard language?

Is CsvPath an easy or hard language?

Comments
16 min read
Git for Beginners: What It Is, Why It Matters, and How to Use It with GitHub

Git for Beginners: What It Is, Why It Matters, and How to Use It with GitHub

3
Comments
2 min read
Day 17: Building a Real ETL Pipeline in Spark Using Bronze-Silver-Gold Architecture
Cover image for Day 17: Building a Real ETL Pipeline in Spark Using Bronze-Silver-Gold Architecture

Day 17: Building a Real ETL Pipeline in Spark Using Bronze-Silver-Gold Architecture

Comments
1 min read
Understanding Salesforce Data 360 Objects: The Core of the Unified Customer Profile
Cover image for Understanding Salesforce Data 360 Objects: The Core of the Unified Customer Profile

Understanding Salesforce Data 360 Objects: The Core of the Unified Customer Profile

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.