Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
How DiDi Scaled to Hundreds of Petabytes with Apache Ozone

How DiDi Scaled to Hundreds of Petabytes with Apache Ozone

Comments
4 min read
Building Scalable Data Pipelines with Airflow, Docker, and Python: A SightSearch Case Study

Building Scalable Data Pipelines with Airflow, Docker, and Python: A SightSearch Case Study

Comments
3 min read
Introduction to Linux for Data Engineers, Beginner Friendly Approach

Introduction to Linux for Data Engineers, Beginner Friendly Approach

Comments
2 min read
XLTable: Bringing the OLAP Experience Back to Excel on Modern Data Warehouses

XLTable: Bringing the OLAP Experience Back to Excel on Modern Data Warehouses

Comments
4 min read
How to Implement Data Modelling in Power BI
Cover image for How to Implement Data Modelling in Power BI

How to Implement Data Modelling in Power BI

2
Comments
2 min read
AI Data Engineer vs Data Engineer: What Actually Changed? (50+ Job Analysis)
Cover image for AI Data Engineer vs Data Engineer: What Actually Changed? (50+ Job Analysis)

AI Data Engineer vs Data Engineer: What Actually Changed? (50+ Job Analysis)

Comments
4 min read
SellerSprite Alternative: Building a Cost-Effective Amazon Data Pipeline with Pangolinfo API
Cover image for SellerSprite Alternative: Building a Cost-Effective Amazon Data Pipeline with Pangolinfo API

SellerSprite Alternative: Building a Cost-Effective Amazon Data Pipeline with Pangolinfo API

5
Comments
6 min read
Mitigating 'Scraping Shock': Engineering Cost-Aware Data Pipelines
Cover image for Mitigating 'Scraping Shock': Engineering Cost-Aware Data Pipelines

Mitigating 'Scraping Shock': Engineering Cost-Aware Data Pipelines

Comments
5 min read
JSONPath Is In! The AI Assistant Will See You Now
Cover image for JSONPath Is In! The AI Assistant Will See You Now

JSONPath Is In! The AI Assistant Will See You Now

Comments
4 min read
How I automated MongoDB JSON Flattening for Analytics (No ETL)
Cover image for How I automated MongoDB JSON Flattening for Analytics (No ETL)

How I automated MongoDB JSON Flattening for Analytics (No ETL)

Comments
2 min read
Custom Functions FTW
Cover image for Custom Functions FTW

Custom Functions FTW

Comments
4 min read
Apache Data Lakehouse Weekly: January 20–27, 2026

Apache Data Lakehouse Weekly: January 20–27, 2026

Comments
5 min read
Observability: Monitoring Spiders with Prometheus and Grafana
Cover image for Observability: Monitoring Spiders with Prometheus and Grafana

Observability: Monitoring Spiders with Prometheus and Grafana

Comments
12 min read
SCHEMAS AND DATA MODELLING IN POWER BI
Cover image for SCHEMAS AND DATA MODELLING IN POWER BI

SCHEMAS AND DATA MODELLING IN POWER BI

1
Comments 1
2 min read
Comparing Validatar to CsvPath Validation
Cover image for Comparing Validatar to CsvPath Validation

Comparing Validatar to CsvPath Validation

Comments
7 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.