Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
DBT (Data Build Tool)
Cover image for DBT (Data Build Tool)

DBT (Data Build Tool)

3
Comments 1
4 min read
My Data Engineering Library
Cover image for My Data Engineering Library

My Data Engineering Library

Comments
2 min read
Shipping Data in Real Time Debezium : Part 1
Cover image for Shipping Data in Real Time Debezium : Part 1

Shipping Data in Real Time Debezium : Part 1

4
Comments 4
2 min read
How to Transpose Columns in Each Group to a Single Row

How to Transpose Columns in Each Group to a Single Row

7
Comments
2 min read
XGBoost Training Speed: A Comparative Analysis

XGBoost Training Speed: A Comparative Analysis

Comments
2 min read
Loops and Vectorization in Python

Loops and Vectorization in Python

Comments
1 min read
Embarking on the Data Odyssey: A Deep Dive into Data Engineering for Tech Enthusiasts
Cover image for Embarking on the Data Odyssey: A Deep Dive into Data Engineering for Tech Enthusiasts

Embarking on the Data Odyssey: A Deep Dive into Data Engineering for Tech Enthusiasts

Comments
3 min read
Apache Doris 2.1.0: TPC-DS, Parallel Adaptive Scan, Local Shuffle, Arrow Flight-based HTTP Data API

Apache Doris 2.1.0: TPC-DS, Parallel Adaptive Scan, Local Shuffle, Arrow Flight-based HTTP Data API

Comments
29 min read
RisingWave workshop
Cover image for RisingWave workshop

RisingWave workshop

2
Comments
5 min read
Production and CI/CD in dbt
Cover image for Production and CI/CD in dbt

Production and CI/CD in dbt

2
Comments
3 min read
My Experience with Apache Airflow
Cover image for My Experience with Apache Airflow

My Experience with Apache Airflow

9
Comments
3 min read
"Day 42 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -21)
Cover image for "Day 42 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -21)

"Day 42 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Mathematics for Data Analysis (Stats Day -21)

1
Comments
1 min read
Different file formats, a benchmark doing basic operations
Cover image for Different file formats, a benchmark doing basic operations

Different file formats, a benchmark doing basic operations

10
Comments 2
9 min read
5 reasons Dremio is the ideal Apache Iceberg Lakehouse Platform
Cover image for 5 reasons Dremio is the ideal Apache Iceberg Lakehouse Platform

5 reasons Dremio is the ideal Apache Iceberg Lakehouse Platform

Comments
5 min read
When Metrics Go Awry: Analyzing KPIs using machine learning, regression analysis, and Shapley values
Cover image for When Metrics Go Awry: Analyzing KPIs using machine learning, regression analysis, and Shapley values

When Metrics Go Awry: Analyzing KPIs using machine learning, regression analysis, and Shapley values

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.