Forem

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Fixing Type Hints for Callable Objects with Custom Signatures in Dagster

Fixing Type Hints for Callable Objects with Custom Signatures in Dagster

1
Comments
3 min read
Apache Spark সহজভাবে জানি

Apache Spark সহজভাবে জানি

1
Comments
1 min read
Building a Test Data Platform After Watching Teams Secretly Use Production for Years
Cover image for Building a Test Data Platform After Watching Teams Secretly Use Production for Years

Building a Test Data Platform After Watching Teams Secretly Use Production for Years

1
Comments
3 min read
Kafka

Kafka

3
Comments
10 min read
Chinese DBA's Story: Sui Haifeng - Grasp the two most important five-year periods of your career

Chinese DBA's Story: Sui Haifeng - Grasp the two most important five-year periods of your career

Comments
5 min read
A Modern Data Governance Framework for Google Cloud: Implementing Just-Enough and Just-in-Time Access

A Modern Data Governance Framework for Google Cloud: Implementing Just-Enough and Just-in-Time Access

3
Comments
8 min read
Scaling Customer Analytics: Designing ML Pipelines for Millions of Users

Scaling Customer Analytics: Designing ML Pipelines for Millions of Users

Comments
7 min read
Temperature, Tokens, and Context Windows: The Three Pillars of LLM Control

Temperature, Tokens, and Context Windows: The Three Pillars of LLM Control

3
Comments
13 min read
Building Intelligent, Metadata-Driven Pipelines with Azure Data Factory

Building Intelligent, Metadata-Driven Pipelines with Azure Data Factory

3
Comments
6 min read
Why Your Enterprise Data Platform Is No Longer Just for Analytics
Cover image for Why Your Enterprise Data Platform Is No Longer Just for Analytics

Why Your Enterprise Data Platform Is No Longer Just for Analytics

2
Comments 1
11 min read
The State of Apache Iceberg, Polaris, and Arrow: October–November 2025
Cover image for The State of Apache Iceberg, Polaris, and Arrow: October–November 2025

The State of Apache Iceberg, Polaris, and Arrow: October–November 2025

2
Comments
7 min read
Realtime Data Streaming Platform: Building a Unified Monitoring Stack

Realtime Data Streaming Platform: Building a Unified Monitoring Stack

3
Comments
8 min read
Real-Time Streaming Platform with Pulsar, Flink & ClickHouse

Real-Time Streaming Platform with Pulsar, Flink & ClickHouse

4
Comments
6 min read
Why Parquet Is Everywhere - And What Makes It Actually Fast?
Cover image for Why Parquet Is Everywhere - And What Makes It Actually Fast?

Why Parquet Is Everywhere - And What Makes It Actually Fast?

2
Comments
3 min read
🎓 Building a Smart LMS Assistant: RAG System with Pinecone for Multi-Source Learning Data

🎓 Building a Smart LMS Assistant: RAG System with Pinecone for Multi-Source Learning Data

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.