Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Linux for Data Engineers: From Terminal to Text Editing
Edmund Eryuba
Edmund Eryuba
Edmund Eryuba
Follow
Jan 25
Linux for Data Engineers: From Terminal to Text Editing
#
linux
#
dataengineering
#
opensource
1
 reaction
Comments
Add Comment
16 min read
LINUX FOR A ROOKIE DATA ENGINEERING STUDENT
Shaban Ibrahim
Shaban Ibrahim
Shaban Ibrahim
Follow
Jan 25
LINUX FOR A ROOKIE DATA ENGINEERING STUDENT
#
linux
#
dataengineering
#
vim
#
datascience
4
 reactions
Comments
Add Comment
5 min read
The Missing Step in RAG: Why Your Vector DB is Bloated (and how to fix it locally)
Damian
Damian
Damian
Follow
Dec 20 '25
The Missing Step in RAG: Why Your Vector DB is Bloated (and how to fix it locally)
#
dataengineering
#
rag
#
python
#
opensource
1
 reaction
Comments
Add Comment
3 min read
Introduction to Linux for Data Engineers, Including Practical Use of Vi and Nano with Examples
GeraldM
GeraldM
GeraldM
Follow
Jan 25
Introduction to Linux for Data Engineers, Including Practical Use of Vi and Nano with Examples
#
linux
#
dataengineering
#
vim
#
nano
5
 reactions
Comments
Add Comment
6 min read
Introduction to Linux for Data Engineers, Including Practical Use of Vi and Nano with Examples
MJ-O
MJ-O
MJ-O
Follow
Jan 24
Introduction to Linux for Data Engineers, Including Practical Use of Vi and Nano with Examples
#
linuxforbeginners
#
dataengineering
#
programming
#
ai
2
 reactions
Comments
Add Comment
3 min read
Data Quality at Scale: Validating Scrapes with Pydantic
Lalit Mishra
Lalit Mishra
Lalit Mishra
Follow
Jan 23
Data Quality at Scale: Validating Scrapes with Pydantic
#
automation
#
codequality
#
dataengineering
#
python
3
 reactions
Comments
2
 comments
13 min read
Building a CDC Skyscraper: How SeaTunnel Leverages Debezium Under the Hood
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Dec 19 '25
Building a CDC Skyscraper: How SeaTunnel Leverages Debezium Under the Hood
#
dataengineering
#
database
#
opensource
#
architecture
Comments
Add Comment
3 min read
Medallion Architecture 101: Building Data Pipelines That Don't Fall Apart
Aaron Wiegel
Aaron Wiegel
Aaron Wiegel
Follow
Jan 23
Medallion Architecture 101: Building Data Pipelines That Don't Fall Apart
#
dataengineering
#
database
#
python
#
sql
Comments
Add Comment
11 min read
Amazon S3 Tables Just Got Smarter: Intelligent-Tiering & Native Replication Explained
Sumsuzzaman Chowdhury
Sumsuzzaman Chowdhury
Sumsuzzaman Chowdhury
Follow
for
AWS Community Builders
Jan 1
Amazon S3 Tables Just Got Smarter: Intelligent-Tiering & Native Replication Explained
#
aws
#
dataengineering
#
analytics
#
cloud
Comments
Add Comment
4 min read
My Friday "Sanity Savers" (Software, Data & DevOps edition) 🛠️
Salisu Adeboye
Salisu Adeboye
Salisu Adeboye
Follow
Jan 23
My Friday "Sanity Savers" (Software, Data & DevOps edition) 🛠️
#
devops
#
dataengineering
#
python
#
productivity
Comments
1
 comment
1 min read
Pipelines, ETL, and Warehouses: The DNA of Data Engineering
Vinicius Fagundes
Vinicius Fagundes
Vinicius Fagundes
Follow
Jan 23
Pipelines, ETL, and Warehouses: The DNA of Data Engineering
#
dataengineering
#
datascience
#
beginners
#
career
5
 reactions
Comments
3
 comments
4 min read
Bulletproof Power Query (Part 2): A Smart, Fuzzy-Match Rename Function
Ahmed Essam
Ahmed Essam
Ahmed Essam
Follow
Dec 18 '25
Bulletproof Power Query (Part 2): A Smart, Fuzzy-Match Rename Function
#
powerquery
#
powerbi
#
dataengineering
#
excel
Comments
Add Comment
4 min read
System Architecture Analysis: The Data Pipeline Issues of TraderKnows
Bittam
Bittam
Bittam
Follow
Dec 19 '25
System Architecture Analysis: The Data Pipeline Issues of TraderKnows
#
dataengineering
#
fintech
#
traderknows
#
webscraping
Comments
Add Comment
2 min read
Building an AI-Powered Customer Churn Prediction Pipeline on AWS (Step-by-Step)
WanjohiChristopher
WanjohiChristopher
WanjohiChristopher
Follow
for
AWS Community Builders
Jan 1
Building an AI-Powered Customer Churn Prediction Pipeline on AWS (Step-by-Step)
#
aws
#
machinelearning
#
dataengineering
#
python
2
 reactions
Comments
Add Comment
5 min read
Beyond Tagging: A Blueprint for Real-Time Cost Attribution in Data Platforms
Mahendran
Mahendran
Mahendran
Follow
Dec 22 '25
Beyond Tagging: A Blueprint for Real-Time Cost Attribution in Data Platforms
#
dataengineering
#
finops
#
bigdata
#
costoptimization
Comments
Add Comment
9 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account