Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
PySpark to Pandas/scikit-learn: A Practical Migration Guide for Data Engineers Learning ML
Nyson Markus
Nyson Markus
Nyson Markus
Follow
Apr 10
PySpark to Pandas/scikit-learn: A Practical Migration Guide for Data Engineers Learning ML
#
dataengineering
#
datascience
#
machinelearning
#
python
Comments
Add Comment
7 min read
ETL vs ELT: Which One Should You Use and Why?
John Wakaba
John Wakaba
John Wakaba
Follow
Apr 10
ETL vs ELT: Which One Should You Use and Why?
#
architecture
#
beginners
#
data
#
dataengineering
1
reaction
Comments
Add Comment
6 min read
Entity Resolution at Scale: Matching Products Across Amazon, Reddit, and RTINGS
Daniel Rozin
Daniel Rozin
Daniel Rozin
Follow
Apr 10
Entity Resolution at Scale: Matching Products Across Amazon, Reddit, and RTINGS
#
ai
#
webdev
#
dataengineering
#
tutorial
Comments
Add Comment
4 min read
Apache Data Lakehouse Weekly: April 3–9, 2026
Alex Merced
Alex Merced
Alex Merced
Follow
Apr 9
Apache Data Lakehouse Weekly: April 3–9, 2026
#
news
#
data
#
dataengineering
#
opensource
Comments
Add Comment
7 min read
AWS Lake Formation: Why Your Data Lake Permissions Are Probably a Mess (And How to Fix That)
Soumyadeep Basu
Soumyadeep Basu
Soumyadeep Basu
Follow
Apr 9
AWS Lake Formation: Why Your Data Lake Permissions Are Probably a Mess (And How to Fix That)
#
dataengineering
#
awsdatalake
#
aws
Comments
Add Comment
3 min read
ETL VS ELT: WHICH ONE SHOULD YOU USE AND WHY?
Wangeci Ndovu
Wangeci Ndovu
Wangeci Ndovu
Follow
Apr 10
ETL VS ELT: WHICH ONE SHOULD YOU USE AND WHY?
#
analytics
#
beginners
#
data
#
dataengineering
Comments
Add Comment
5 min read
Airflow vs Prefect vs Dagster: Picking the Right Orchestrator in 2026
DataStackX
DataStackX
DataStackX
Follow
Apr 9
Airflow vs Prefect vs Dagster: Picking the Right Orchestrator in 2026
#
dataengineering
#
python
#
airflow
#
dagster
Comments
Add Comment
6 min read
Advanced SQL Techniques for Data Analytics Every Data Analyst Should Know
Lawrence Murithi
Lawrence Murithi
Lawrence Murithi
Follow
Apr 9
Advanced SQL Techniques for Data Analytics Every Data Analyst Should Know
#
sql
#
luxdev
#
dataengineering
Comments
Add Comment
6 min read
Your Customer Table Has Duplicates You Can't See With SQL How I Built a Cross-Platform Identity Resolution Layer for a Dark Kitchen Data Platform
SARAN TEJA MALLELA
SARAN TEJA MALLELA
SARAN TEJA MALLELA
Follow
Apr 9
Your Customer Table Has Duplicates You Can't See With SQL How I Built a Cross-Platform Identity Resolution Layer for a Dark Kitchen Data Platform
#
dataengineering
#
apachespark
#
kafka
#
deltalake
3
reactions
Comments
Add Comment
8 min read
How to Bypass the Pandas "Object Tax": Building an 8x Faster CSV Engine in C
NARESH-CN2
NARESH-CN2
NARESH-CN2
Follow
Apr 9
How to Bypass the Pandas "Object Tax": Building an 8x Faster CSV Engine in C
#
python
#
performance
#
dataengineering
#
datascience
Comments
Add Comment
2 min read
PostgreSQL Foreign Data Wrappers: Cross-Database Queries Explained
Philip McClarence
Philip McClarence
Philip McClarence
Follow
Apr 9
PostgreSQL Foreign Data Wrappers: Cross-Database Queries Explained
#
database
#
dataengineering
#
postgres
#
sql
Comments
Add Comment
4 min read
How Google Maps Predicts Traffic in Real Time: Live Data and ETA Explained
Ashish Kumar
Ashish Kumar
Ashish Kumar
Follow
Apr 9
How Google Maps Predicts Traffic in Real Time: Live Data and ETA Explained
#
googlemaps
#
traffic
#
gps
#
dataengineering
Comments
Add Comment
3 min read
How Gudu SQL Omni Works: Accurate Offline Data Lineage Analysis in VS Code
沈欢
沈欢
沈欢
Follow
Apr 9
How Gudu SQL Omni Works: Accurate Offline Data Lineage Analysis in VS Code
#
dataengineering
#
sql
#
tooling
#
vscode
Comments
Add Comment
3 min read
ETL vs ELT: Which One Should You Use and Why?
Rose1845
Rose1845
Rose1845
Follow
Apr 9
ETL vs ELT: Which One Should You Use and Why?
#
dataengineering
3
reactions
Comments
Add Comment
4 min read
ETL vs ELT in Data Engineering: Key Differences and Use Cases Explained
GeoPITS Global
GeoPITS Global
GeoPITS Global
Follow
Apr 9
ETL vs ELT in Data Engineering: Key Differences and Use Cases Explained
#
dataengineering
#
etlvseltindataengineering
#
etlvselt
Comments
Add Comment
3 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account