Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Overview: Change Data Capture (CDC)
longtk26
longtk26
longtk26
Follow
Apr 17
Overview: Change Data Capture (CDC)
#
database
#
dataengineering
#
distributedsystems
#
systemdesign
Comments
Add Comment
4 min read
The NDC Revolution and What It Means for Data Engineers in Travel Tech
Martin Tuncaydin
Martin Tuncaydin
Martin Tuncaydin
Follow
Apr 17
The NDC Revolution and What It Means for Data Engineers in Travel Tech
#
ndc
#
dataengineering
#
airlinetechnology
#
traveltech
Comments
Add Comment
6 min read
The Missing Layer in Your Data Stack Why Semantic Intelligence Matters More Than Another BI Tool
Hello Arisyn
Hello Arisyn
Hello Arisyn
Follow
Apr 17
The Missing Layer in Your Data Stack Why Semantic Intelligence Matters More Than Another BI Tool
#
dataengineering
#
semanticlayer
#
datagovernance
#
naturallanguagequery
Comments
Add Comment
4 min read
Data Pipelines Explained Simply (and How to Build Them with Python)
Anthony Gicheru
Anthony Gicheru
Anthony Gicheru
Follow
Apr 17
Data Pipelines Explained Simply (and How to Build Them with Python)
#
etl
#
python
#
datapipeline
#
dataengineering
1
reaction
Comments
Add Comment
2 min read
Building NewHomie property analytics tool — Part 1
Meng Lin
Meng Lin
Meng Lin
Follow
Apr 16
Building NewHomie property analytics tool — Part 1
#
dataengineering
#
aws
#
softwareengineering
#
distributedsystems
Comments
Add Comment
8 min read
Stop Rewriting the Same LLM Boilerplate: Batch-Process DataFrames in 3 Lines
ptimizeroracle
ptimizeroracle
ptimizeroracle
Follow
Apr 16
Stop Rewriting the Same LLM Boilerplate: Batch-Process DataFrames in 3 Lines
#
python
#
opensource
#
dataengineering
#
llm
Comments
Add Comment
4 min read
How to Add a Data Quality Gate to Your Airflow Pipeline in 5 Minutes
Vignesh
Vignesh
Vignesh
Follow
Apr 17
How to Add a Data Quality Gate to Your Airflow Pipeline in 5 Minutes
#
dataengineering
#
airflow
#
etl
#
programming
Comments
Add Comment
4 min read
Processing High Frequency Solar Data Without HPC: Real Constraints and Design Decisions in MackSun
Wilians Conde
Wilians Conde
Wilians Conde
Follow
Apr 16
Processing High Frequency Solar Data Without HPC: Real Constraints and Design Decisions in MackSun
#
dataengineering
#
mongodb
#
systemdesign
#
bigdata
Comments
Add Comment
3 min read
What Spark Interviews Actually Test (Based on 189 Real Interview Reports)
DataDriven
DataDriven
DataDriven
Follow
Apr 16
What Spark Interviews Actually Test (Based on 189 Real Interview Reports)
#
dataengineering
#
interview
#
career
#
programming
Comments
Add Comment
7 min read
Why Semantic Layers Need Distributional Validation, Not Just Schema Validation
Anthony Johnson II
Anthony Johnson II
Anthony Johnson II
Follow
Apr 16
Why Semantic Layers Need Distributional Validation, Not Just Schema Validation
#
dataquality
#
semanticlayer
#
dataengineering
#
opensource
Comments
Add Comment
9 min read
AWS Lake Formation: TBAC vs NBAC — The Permission Model Decision That Will Define Your Data Lake
Soumyadeep Basu
Soumyadeep Basu
Soumyadeep Basu
Follow
Apr 16
AWS Lake Formation: TBAC vs NBAC — The Permission Model Decision That Will Define Your Data Lake
#
aws
#
dataengineering
#
terraform
#
cloud
Comments
Add Comment
4 min read
DuckDB in the Wild: What 6 Minutes of Benchmarking Across 4 Machines Taught Me About Real-World Performance
XIANGWEIXIAO
XIANGWEIXIAO
XIANGWEIXIAO
Follow
Apr 16
DuckDB in the Wild: What 6 Minutes of Benchmarking Across 4 Machines Taught Me About Real-World Performance
#
database
#
dataengineering
#
performance
#
sql
Comments
Add Comment
5 min read
ETL vs ELT: Understanding the Two Pillars of Modern Data Engineering
jim kinyua
jim kinyua
jim kinyua
Follow
Apr 16
ETL vs ELT: Understanding the Two Pillars of Modern Data Engineering
#
dataengineering
#
etl
#
database
#
beginners
Comments
Add Comment
16 min read
Financial Data Integration: A Practical Guide
Andrew Tan
Andrew Tan
Andrew Tan
Follow
Apr 16
Financial Data Integration: A Practical Guide
#
architecture
#
dataengineering
#
systemdesign
#
tutorial
Comments
Add Comment
7 min read
Why Real-Time Data Integration Matters for Modern Applications
Andrew Tan
Andrew Tan
Andrew Tan
Follow
Apr 16
Why Real-Time Data Integration Matters for Modern Applications
#
data
#
dataengineering
#
systemdesign
#
devops
Comments
Add Comment
5 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account