Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
SQL CASE Statements: The Order Matters!
Dror Atariah
Dror Atariah
Dror Atariah
Follow
Jul 18 '25
SQL CASE Statements: The Order Matters!
#
sql
#
database
#
dataengineering
#
python
Comments
1
 comment
2 min read
Apache Iceberg Table Optimization #9: Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery
Alex Merced
Alex Merced
Alex Merced
Follow
Jul 17 '25
Apache Iceberg Table Optimization #9: Managing Large-Scale Optimizations — Parallelism, Checkpointing, and Fail Recovery
#
database
#
datascience
#
dataengineering
#
apacheiceberg
Comments
Add Comment
3 min read
Apache Iceberg Table Optimization #10: The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg
Alex Merced
Alex Merced
Alex Merced
Follow
Jul 17 '25
Apache Iceberg Table Optimization #10: The Endgame — Building an Autonomous Optimization Pipeline for Apache Iceberg
#
database
#
datascience
#
dataengineering
#
apacheiceberg
Comments
Add Comment
3 min read
Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg
Alex Merced
Alex Merced
Alex Merced
Follow
Jul 17 '25
Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg
#
database
#
datascience
#
dataengineering
#
apacheiceberg
Comments
Add Comment
3 min read
Apache Iceberg Table Optimization #2: The Basics of Compaction — Bin Packing Your Data for Efficiency
Alex Merced
Alex Merced
Alex Merced
Follow
Jul 17 '25
Apache Iceberg Table Optimization #2: The Basics of Compaction — Bin Packing Your Data for Efficiency
#
dataengineering
#
database
#
datascience
#
apacheiceberg
Comments
Add Comment
3 min read
🧪 Virtual Environments for Data Engineers — 2025 Edition
Aleksei Aleinikov
Aleksei Aleinikov
Aleksei Aleinikov
Follow
Jun 14 '25
🧪 Virtual Environments for Data Engineers — 2025 Edition
#
dataengineering
#
python
#
poetry
#
venv
Comments
Add Comment
1 min read
Apache Iceberg Table Optimization #7: Using Iceberg Metadata Tables to Determine When Compaction Is Needed
Alex Merced
Alex Merced
Alex Merced
Follow
Jul 17 '25
Apache Iceberg Table Optimization #7: Using Iceberg Metadata Tables to Determine When Compaction Is Needed
#
database
#
datascience
#
dataengineering
#
apacheiceberg
Comments
Add Comment
3 min read
Apache Iceberg Table Optimization #4: Smarter Data Layout — Sorting and Clustering Iceberg Tables
Alex Merced
Alex Merced
Alex Merced
Follow
Jul 17 '25
Apache Iceberg Table Optimization #4: Smarter Data Layout — Sorting and Clustering Iceberg Tables
#
database
#
datascience
#
dataengineering
#
apacheiceberg
1
 reaction
Comments
Add Comment
3 min read
Apache Iceberg Table Optimization #5: Avoiding Metadata Bloat with Snapshot Expiration and Rewriting Manifests
Alex Merced
Alex Merced
Alex Merced
Follow
Jul 17 '25
Apache Iceberg Table Optimization #5: Avoiding Metadata Bloat with Snapshot Expiration and Rewriting Manifests
#
database
#
datascience
#
dataengineering
#
apacheiceberg
Comments
Add Comment
3 min read
Apache Iceberg Table Optimization #3: Optimizing Compaction for Streaming Workloads in Apache Iceberg
Alex Merced
Alex Merced
Alex Merced
Follow
Jul 17 '25
Apache Iceberg Table Optimization #3: Optimizing Compaction for Streaming Workloads in Apache Iceberg
#
database
#
datascience
#
dataengineering
#
apacheiceberg
Comments
Add Comment
3 min read
Apache Iceberg Table Optimization #1: The Cost of Neglect — How Apache Iceberg Tables Degrade Without Optimization
Alex Merced
Alex Merced
Alex Merced
Follow
Jul 17 '25
Apache Iceberg Table Optimization #1: The Cost of Neglect — How Apache Iceberg Tables Degrade Without Optimization
#
apacheiceberg
#
dataengineering
#
datascience
Comments
Add Comment
3 min read
Database Design Errors to Avoid & How To Fix Them
Roxana Maria Haidiner
Roxana Maria Haidiner
Roxana Maria Haidiner
Follow
Jul 17 '25
Database Design Errors to Avoid & How To Fix Them
#
database
#
sql
#
mysql
#
dataengineering
11
 reactions
Comments
1
 comment
5 min read
Cross-Platform Multi-Channel Attribution in Marketing: Balancing Costs and Results Across Devices
Andrey
Andrey
Andrey
Follow
Jul 17 '25
Cross-Platform Multi-Channel Attribution in Marketing: Balancing Costs and Results Across Devices
#
analytics
#
marketing
#
attribution
#
dataengineering
1
 reaction
Comments
Add Comment
5 min read
2025 Data Warehouse Benchmark: What BigQuery, Snowflake, and Others Don’t Tell You
Sourabh Gupta
Sourabh Gupta
Sourabh Gupta
Follow
for
Estuary
Jul 17 '25
2025 Data Warehouse Benchmark: What BigQuery, Snowflake, and Others Don’t Tell You
#
dataengineering
#
cloud
#
datawarehouse
#
benchmarking
3
 reactions
Comments
Add Comment
2 min read
đź›’ Real-Life Data Lakehouse Use Case: Revolutionizing Retail Analytics
Vinicius Fagundes
Vinicius Fagundes
Vinicius Fagundes
Follow
Jul 16 '25
đź›’ Real-Life Data Lakehouse Use Case: Revolutionizing Retail Analytics
#
datascience
#
dataengineering
#
data
#
ai
2
 reactions
Comments
Add Comment
2 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account