Forem

# bigdata

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
A Real-World Approach to Splitting Analytics Workloads Between Databricks and Trino

A Real-World Approach to Splitting Analytics Workloads Between Databricks and Trino

Comments
2 min read
BigQuery Sharing: An Underrated Data Exchange Platform You Should Know

BigQuery Sharing: An Underrated Data Exchange Platform You Should Know

Comments
4 min read
Part 1 | A Scheduler Is More Than Just a “Timer”

Part 1 | A Scheduler Is More Than Just a “Timer”

Comments
4 min read
How to Implement Data Modelling in Power BI
Cover image for How to Implement Data Modelling in Power BI

How to Implement Data Modelling in Power BI

2
Comments
2 min read
Designing a Cross-Cloud Data Plane with Apache Iceberg
Cover image for Designing a Cross-Cloud Data Plane with Apache Iceberg

Designing a Cross-Cloud Data Plane with Apache Iceberg

2
Comments
5 min read
Arisyn: Rebuilding Data Relationship Discovery as Infrastructure

Arisyn: Rebuilding Data Relationship Discovery as Infrastructure

1
Comments 1
3 min read
Overview of Real-Time Data Synchronization from PostgreSQL to VeloDB

Overview of Real-Time Data Synchronization from PostgreSQL to VeloDB

Comments
5 min read
Bigtable vs BigQuery: What’s the difference? (2026 Guide)
Cover image for Bigtable vs BigQuery: What’s the difference? (2026 Guide)

Bigtable vs BigQuery: What’s the difference? (2026 Guide)

5
Comments
4 min read
Apache Iceberg Explained: From Data Lakes to Metadata, Snapshots, and Real-World Usage
Cover image for Apache Iceberg Explained: From Data Lakes to Metadata, Snapshots, and Real-World Usage

Apache Iceberg Explained: From Data Lakes to Metadata, Snapshots, and Real-World Usage

2
Comments
4 min read
SeaTunnel CDC Explained: A Layman’s Guide

SeaTunnel CDC Explained: A Layman’s Guide

Comments
7 min read
Deep Dive into SeaTunnel Metadata Caching: The Underlying Logic Supporting Tens of Thousands of Concurrent Tasks

Deep Dive into SeaTunnel Metadata Caching: The Underlying Logic Supporting Tens of Thousands of Concurrent Tasks

Comments
5 min read
Why Apache Ozone is the Preferred Object Store for Big Data
Cover image for Why Apache Ozone is the Preferred Object Store for Big Data

Why Apache Ozone is the Preferred Object Store for Big Data

Comments
3 min read
Exploring Dynamic Return Types in PySpark pandas_udf

Exploring Dynamic Return Types in PySpark pandas_udf

Comments
2 min read
Day 30: From Zero to Production-Ready Spark Data Engineer
Cover image for Day 30: From Zero to Production-Ready Spark Data Engineer

Day 30: From Zero to Production-Ready Spark Data Engineer

Comments
2 min read
Day 28: Spark Streaming Performance Tuning
Cover image for Day 28: Spark Streaming Performance Tuning

Day 28: Spark Streaming Performance Tuning

Comments
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.