Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
bigdata
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
A Real-World Approach to Splitting Analytics Workloads Between Databricks and Trino
angga faizul
angga faizul
angga faizul
Follow
Feb 12
A Real-World Approach to Splitting Analytics Workloads Between Databricks and Trino
#
databricks
#
trino
#
analytics
#
bigdata
4
 reactions
Comments
Add Comment
2 min read
BigQuery Sharing: An Underrated Data Exchange Platform You Should Know
Sijohn Mathew
Sijohn Mathew
Sijohn Mathew
Follow
Feb 9
BigQuery Sharing: An Underrated Data Exchange Platform You Should Know
#
googlecloud
#
bigquery
#
gcp
#
bigdata
Comments
1
 comment
4 min read
Why Apache Ozone is the Preferred Object Store for Big Data
Tayfun Yalcinkaya
Tayfun Yalcinkaya
Tayfun Yalcinkaya
Follow
Jan 5
Why Apache Ozone is the Preferred Object Store for Big Data
#
dataengineering
#
bigdata
#
datalakehouse
#
apacheozone
Comments
Add Comment
3 min read
Part 1 | A Scheduler Is More Than Just a “Timer”
Chen Debra
Chen Debra
Chen Debra
Follow
Feb 5
Part 1 | A Scheduler Is More Than Just a “Timer”
#
apachedolphinscheduler
#
opensource
#
programming
#
bigdata
1
 reaction
Comments
Add Comment
4 min read
Exploring Dynamic Return Types in PySpark pandas_udf
dss99911
dss99911
dss99911
Follow
Dec 30 '25
Exploring Dynamic Return Types in PySpark pandas_udf
#
pyspark
#
python
#
dataengineering
#
bigdata
Comments
Add Comment
2 min read
Day 30: From Zero to Production-Ready Spark Data Engineer
Sandeep
Sandeep
Sandeep
Follow
Dec 30 '25
Day 30: From Zero to Production-Ready Spark Data Engineer
#
dataengineering
#
spark
#
bigdata
#
python
Comments
Add Comment
2 min read
Day 27: Building Exactly-Once Streaming Pipelines with Spark & Delta Lake
Sandeep
Sandeep
Sandeep
Follow
Dec 29 '25
Day 27: Building Exactly-Once Streaming Pipelines with Spark & Delta Lake
#
dataengineering
#
spark
#
bigdata
#
python
Comments
Add Comment
1 min read
Day 28: Spark Streaming Performance Tuning
Sandeep
Sandeep
Sandeep
Follow
Dec 29 '25
Day 28: Spark Streaming Performance Tuning
#
dataengineering
#
spark
#
bigdata
#
python
Comments
Add Comment
1 min read
Day 29: Building a Production-Grade Real-Time ETL Pipeline with Spark & Delta
Sandeep
Sandeep
Sandeep
Follow
Dec 29 '25
Day 29: Building a Production-Grade Real-Time ETL Pipeline with Spark & Delta
#
dataengineering
#
spark
#
bigdata
#
python
Comments
Add Comment
1 min read
Day 26: Spark Streaming Joins
Sandeep
Sandeep
Sandeep
Follow
Dec 26 '25
Day 26: Spark Streaming Joins
#
dataengineering
#
spark
#
bigdata
#
python
Comments
Add Comment
1 min read
Apache SeaTunnel 2.3.10 Source Code Analysis: Zeta Engine Service Startup
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Dec 26 '25
Apache SeaTunnel 2.3.10 Source Code Analysis: Zeta Engine Service Startup
#
apacheseatunnel
#
programming
#
opensource
#
bigdata
Comments
Add Comment
5 min read
Day 25: Streaming Aggregations in Spark
Sandeep
Sandeep
Sandeep
Follow
Dec 25 '25
Day 25: Streaming Aggregations in Spark
#
dataengineering
#
spark
#
bigdata
#
python
Comments
Add Comment
1 min read
Day 24: Spark Structured Streaming
Sandeep
Sandeep
Sandeep
Follow
Dec 24 '25
Day 24: Spark Structured Streaming
#
dataengineering
#
spark
#
bigdata
#
python
Comments
Add Comment
1 min read
Day 23: Spark Shuffle Optimization
Sandeep
Sandeep
Sandeep
Follow
Dec 23 '25
Day 23: Spark Shuffle Optimization
#
dataengineering
#
spark
#
bigdata
#
python
Comments
Add Comment
1 min read
Day 22: Spark Shuffle Deep Dive
Sandeep
Sandeep
Sandeep
Follow
Dec 22 '25
Day 22: Spark Shuffle Deep Dive
#
dataengineering
#
spark
#
bigdata
#
python
Comments
Add Comment
1 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account