Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Building Streaming Iceberg Tables for Real-Time Logistics Analytics
Eliana Lam
Eliana Lam
Eliana Lam
Follow
Nov 29 '25
Building Streaming Iceberg Tables for Real-Time Logistics Analytics
#
analytics
#
dataengineering
#
architecture
#
opensource
Comments
Add Comment
4 min read
Building a Scalable Community Health Worker Analytics Platform: My Journey with dbt and Snowflake
Amos Augo
Amos Augo
Amos Augo
Follow
Nov 26 '25
Building a Scalable Community Health Worker Analytics Platform: My Journey with dbt and Snowflake
#
dbt
#
snowflake
#
dataengineering
Comments
Add Comment
4 min read
The Great Table Format Debate: A Deep Dive into Apache Iceberg, Delta Lake, and Apache Hudi
Data Tech Bridge
Data Tech Bridge
Data Tech Bridge
Follow
Dec 31 '25
The Great Table Format Debate: A Deep Dive into Apache Iceberg, Delta Lake, and Apache Hudi
#
database
#
dataengineering
#
iceberg
#
apachehudi
1
 reaction
Comments
Add Comment
18 min read
Amazon Kinesis vs Amazon MSK: The Complete Guide for Stream Processing on AWS
Data Tech Bridge
Data Tech Bridge
Data Tech Bridge
Follow
Dec 30 '25
Amazon Kinesis vs Amazon MSK: The Complete Guide for Stream Processing on AWS
#
architecture
#
aws
#
dataengineering
Comments
Add Comment
29 min read
Day 8: Accelerating Spark Joins - Broadcast, Shuffle Optimization & Skew Handling
Sandeep
Sandeep
Sandeep
Follow
Dec 9 '25
Day 8: Accelerating Spark Joins - Broadcast, Shuffle Optimization & Skew Handling
#
dataengineering
#
python
#
spark
#
bigdata
Comments
Add Comment
2 min read
Mastering Serverless Data Pipelines: AWS Step Functions Best Practices for 2026
Jubin Soni
Jubin Soni
Jubin Soni
Follow
Dec 30 '25
Mastering Serverless Data Pipelines: AWS Step Functions Best Practices for 2026
#
aws
#
serverless
#
stepfunctions
#
dataengineering
Comments
Add Comment
5 min read
2025 Year in Review: Apache Iceberg, Polaris, Parquet, and Arrow
Alex Merced
Alex Merced
Alex Merced
Follow
Dec 29 '25
2025 Year in Review: Apache Iceberg, Polaris, Parquet, and Arrow
#
architecture
#
bigdata
#
opensource
#
dataengineering
Comments
Add Comment
6 min read
A Stranger In a New Town: CsvPath metadata fields
David Kershaw
David Kershaw
David Kershaw
Follow
Nov 25 '25
A Stranger In a New Town: CsvPath metadata fields
#
metadata
#
dataengineering
#
csv
#
datascience
Comments
Add Comment
6 min read
Interesting links - November 2025
Robin Moffatt
Robin Moffatt
Robin Moffatt
Follow
Dec 17 '25
Interesting links - November 2025
#
data
#
dataengineering
#
kafka
#
flink
Comments
Add Comment
19 min read
đź’€ RIP Copy-Paste: Google NotebookLM Just Killed Manual Data Entry
Siddhesh Surve
Siddhesh Surve
Siddhesh Surve
Follow
Dec 29 '25
đź’€ RIP Copy-Paste: Google NotebookLM Just Killed Manual Data Entry
#
ai
#
productivity
#
google
#
dataengineering
Comments
Add Comment
3 min read
Unified Data Fabric: Serverless Spark on ROSA Integrating with AWS Glue Catalog
Marco Gonzalez
Marco Gonzalez
Marco Gonzalez
Follow
Dec 29 '25
Unified Data Fabric: Serverless Spark on ROSA Integrating with AWS Glue Catalog
#
serverless
#
kubernetes
#
aws
#
dataengineering
8
 reactions
Comments
1
 comment
39 min read
dupl
Query Filter
Query Filter
Query Filter
Follow
Nov 25 '25
dupl
#
sql
#
dataengineering
#
backend
#
database
Comments
Add Comment
1 min read
Apache Dev List Digest: Iceberg, Polaris, Arrow & Parquet (Nov 18–24, 2025)
Alex Merced
Alex Merced
Alex Merced
Follow
Nov 24 '25
Apache Dev List Digest: Iceberg, Polaris, Arrow & Parquet (Nov 18–24, 2025)
#
data
#
dataengineering
#
opensource
#
resources
Comments
Add Comment
5 min read
Building a Realistic Banking Dummy Data Generator with Bad-Data Simulation
Benjamin Ibrulj
Benjamin Ibrulj
Benjamin Ibrulj
Follow
Dec 29 '25
Building a Realistic Banking Dummy Data Generator with Bad-Data Simulation
#
dataengineering
#
python
#
opensource
#
sql
1
 reaction
Comments
Add Comment
1 min read
How to Sync Data from an Oracle Table to Elasticsearch using Kafka Connect
maghsood esmaeili
maghsood esmaeili
maghsood esmaeili
Follow
Dec 16 '25
How to Sync Data from an Oracle Table to Elasticsearch using Kafka Connect
#
database
#
kubernetes
#
dataengineering
#
tutorial
1
 reaction
Comments
1
 comment
5 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account