Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
CHW Monthly Activity Aggregation: Turning Visit Logs into Insight
Oliver Samuel
Oliver Samuel
Oliver Samuel
Follow
Dec 2 '25
CHW Monthly Activity Aggregation: Turning Visit Logs into Insight
#
analytics
#
sql
#
dataengineering
#
tooling
Comments
Add Comment
5 min read
🔥 Day 2: Understanding Spark Architecture - How Spark Executes Your Code Internally
Sandeep
Sandeep
Sandeep
Follow
Dec 2 '25
🔥 Day 2: Understanding Spark Architecture - How Spark Executes Your Code Internally
#
spark
#
python
#
dataengineering
#
bigdata
Comments
Add Comment
2 min read
RAG Isn’t a Modeling Problem. It’s a Data Engineering Problem.
Alex Merced
Alex Merced
Alex Merced
Follow
Jan 6
RAG Isn’t a Modeling Problem. It’s a Data Engineering Problem.
#
architecture
#
dataengineering
#
llm
#
rag
1
 reaction
Comments
Add Comment
6 min read
Apache Data Lakehouse Weekly: December 30, 2025 – January 5, 2026
Alex Merced
Alex Merced
Alex Merced
Follow
Jan 6
Apache Data Lakehouse Weekly: December 30, 2025 – January 5, 2026
#
news
#
database
#
dataengineering
#
opensource
1
 reaction
Comments
Add Comment
4 min read
Marmot: Data catalog without the complex infrastructure
Charlie Haley
Charlie Haley
Charlie Haley
Follow
Jan 6
Marmot: Data catalog without the complex infrastructure
#
showdev
#
dataengineering
#
opensource
#
go
1
 reaction
Comments
Add Comment
3 min read
TDD for dbt: unit testing the way it should be
Niclas Olofsson
Niclas Olofsson
Niclas Olofsson
Follow
Jan 7
TDD for dbt: unit testing the way it should be
#
codequality
#
dataengineering
#
softwareengineering
#
testing
2
 reactions
Comments
Add Comment
12 min read
Building a Medical-Grade Knowledge Graph: Mapping Drug Interactions with Neo4j and LlamaIndex 🩺💻
Beck_Moulton
Beck_Moulton
Beck_Moulton
Follow
Jan 6
Building a Medical-Grade Knowledge Graph: Mapping Drug Interactions with Neo4j and LlamaIndex 🩺💻
#
ai
#
python
#
dataengineering
#
neo4j
Comments
1
 comment
3 min read
Schema, COPY, MERGE, and Immutability — A First-Principles Guide for Data Engineers
Shrinivas Vishnupurikar
Shrinivas Vishnupurikar
Shrinivas Vishnupurikar
Follow
Jan 5
Schema, COPY, MERGE, and Immutability — A First-Principles Guide for Data Engineers
#
dataengineering
Comments
Add Comment
5 min read
HackerRank 'The Pads' MySQL
Caroline Caillaud
Caroline Caillaud
Caroline Caillaud
Follow
Dec 1 '25
HackerRank 'The Pads' MySQL
#
dataengineering
#
beginners
#
sql
#
mysql
Comments
Add Comment
3 min read
🔥 Day 5: Introduction to DataFrames - The Most Importantce of Spark API
Sandeep
Sandeep
Sandeep
Follow
Dec 5 '25
🔥 Day 5: Introduction to DataFrames - The Most Importantce of Spark API
#
dataengineering
#
python
#
spark
#
bigdata
Comments
Add Comment
2 min read
Comparing Great Expectations and CsvPath Framework
David Kershaw
David Kershaw
David Kershaw
Follow
Nov 30 '25
Comparing Great Expectations and CsvPath Framework
#
data
#
dataengineering
#
csv
#
database
Comments
Add Comment
8 min read
Financial Transaction Data Reconciler PayPal
Eliana Lam
Eliana Lam
Eliana Lam
Follow
Nov 30 '25
Financial Transaction Data Reconciler PayPal
#
systemdesign
#
distributedsystems
#
dataengineering
#
aws
Comments
Add Comment
5 min read
Streamlit desde cero: cómo crear una app para explorar y visualizar datos
Mirina-Gonzales
Mirina-Gonzales
Mirina-Gonzales
Follow
Jan 3
Streamlit desde cero: cómo crear una app para explorar y visualizar datos
#
python
#
streamlit
#
datascience
#
dataengineering
3
 reactions
Comments
Add Comment
4 min read
Stifel Modern Data Platform
Eliana Lam
Eliana Lam
Eliana Lam
Follow
Nov 30 '25
Stifel Modern Data Platform
#
architecture
#
aws
#
dataengineering
Comments
Add Comment
4 min read
Building Pangolin: My Holiday Break, an AI IDE, and a Lakehouse Catalog for the Curious
Alex Merced
Alex Merced
Alex Merced
Follow
Jan 2
Building Pangolin: My Holiday Break, an AI IDE, and a Lakehouse Catalog for the Curious
#
ai
#
dataengineering
#
devjournal
Comments
Add Comment
6 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account