Skip to content
Navigation menu
Search
Powered by
Search
Algolia
Log in
Create account
Forem
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack
Yann Barrault
Yann Barrault
Yann Barrault
Follow
for
Onepoint
Dec 14 '24
Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack
#
adventoftech2024
#
dataengineering
#
airbyte
#
dbt
7
reactions
Comments
Add Comment
12 min read
Data Pipeline Filters 101: Choosing Between Static and Dynamic Approaches
Muhammad Mahdi Ramadhan
Muhammad Mahdi Ramadhan
Muhammad Mahdi Ramadhan
Follow
Nov 9 '24
Data Pipeline Filters 101: Choosing Between Static and Dynamic Approaches
#
dataengineering
#
etl
#
dataanalyst
#
database
Comments
Add Comment
1 min read
The Apache Iceberg™ Small File Problem
Danica Fine
Danica Fine
Danica Fine
Follow
Dec 11 '24
The Apache Iceberg™ Small File Problem
#
bigdata
#
apacheiceberg
#
datalakehouse
#
dataengineering
9
reactions
Comments
Add Comment
3 min read
Ensuring Data Quality: Best Practices and Automation
BuzzGK
BuzzGK
BuzzGK
Follow
Nov 7 '24
Ensuring Data Quality: Best Practices and Automation
#
data
#
dataengineering
Comments
Add Comment
6 min read
Data Science Simplified: Tips for Aspiring Data Scientists in 2025
Vikas76
Vikas76
Vikas76
Follow
Nov 10 '24
Data Science Simplified: Tips for Aspiring Data Scientists in 2025
#
datascience
#
dataengineering
#
data
#
roadmap
1
reaction
Comments
Add Comment
4 min read
2025 Guide to Architecting an Iceberg Lakehouse
Alex Merced
Alex Merced
Alex Merced
Follow
Dec 9 '24
2025 Guide to Architecting an Iceberg Lakehouse
#
apacheiceberg
#
database
#
dataengineering
#
datascience
5
reactions
Comments
Add Comment
14 min read
Dremio, Apache Iceberg and their role in AI-Ready Data
Alex Merced
Alex Merced
Alex Merced
Follow
Nov 5 '24
Dremio, Apache Iceberg and their role in AI-Ready Data
#
database
#
dataengineering
#
datascience
#
dataanalytics
Comments
Add Comment
7 min read
Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!
SNEHASISH DUTTA
SNEHASISH DUTTA
SNEHASISH DUTTA
Follow
Dec 9 '24
Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!
#
sql
#
python
#
dataengineering
#
eventdriven
2
reactions
Comments
Add Comment
13 min read
One Off to One Data Platform: Design with Intent [Part 2]
Lulu Cheng
Lulu Cheng
Lulu Cheng
Follow
for
jarrid.xyz
Dec 8 '24
One Off to One Data Platform: Design with Intent [Part 2]
#
data
#
architecture
#
dataengineering
#
startup
1
reaction
Comments
Add Comment
5 min read
Case Study: Creating an ETL Data Pipeline using AWS Services - Real-World Problem
M. Abdullah Bin Aftab
M. Abdullah Bin Aftab
M. Abdullah Bin Aftab
Follow
Dec 6 '24
Case Study: Creating an ETL Data Pipeline using AWS Services - Real-World Problem
#
aws
#
dataengineering
#
data
#
cloudcomputing
Comments
Add Comment
2 min read
Choosing the right, real-time, Postgres CDC platform
Eric Goldman
Eric Goldman
Eric Goldman
Follow
for
Sequin
Dec 6 '24
Choosing the right, real-time, Postgres CDC platform
#
postgres
#
database
#
eventdriven
#
dataengineering
Comments
Add Comment
8 min read
ChatGPT Launches Pro: What's it Mean for Data Professionals?
Anthony Clemons
Anthony Clemons
Anthony Clemons
Follow
Dec 5 '24
ChatGPT Launches Pro: What's it Mean for Data Professionals?
#
openai
#
chatgpt
#
datascience
#
dataengineering
2
reactions
Comments
Add Comment
4 min read
Introduction to Apache Kafka
Hiswill Thompson
Hiswill Thompson
Hiswill Thompson
Follow
Dec 5 '24
Introduction to Apache Kafka
#
datapipeline
#
dataengineering
#
realtimedat
3
reactions
Comments
1
comment
3 min read
Mastering Workflow Automation with Apache Airflow for Data Engineering
Aditya Pratap Bhuyan
Aditya Pratap Bhuyan
Aditya Pratap Bhuyan
Follow
Oct 31 '24
Mastering Workflow Automation with Apache Airflow for Data Engineering
#
dataengineering
#
apacheairflow
Comments
Add Comment
6 min read
Mastering Twitter Data Collection: A Comprehensive Guide to Efficient Scraping Solutions
kaito
kaito
kaito
Follow
Dec 3 '24
Mastering Twitter Data Collection: A Comprehensive Guide to Efficient Scraping Solutions
#
twitter
#
datascience
#
dataengineering
#
development
Comments
Add Comment
3 min read
Seaborn Cheat Sheet
Arum Puri
Arum Puri
Arum Puri
Follow
Nov 30 '24
Seaborn Cheat Sheet
#
seaborn
#
datascience
#
cheatsheet
#
dataengineering
1
reaction
Comments
Add Comment
2 min read
Jupyter Notebooks in Docker
Hassan Aftab
Hassan Aftab
Hassan Aftab
Follow
Nov 29 '24
Jupyter Notebooks in Docker
#
datascience
#
docker
#
dataengineering
#
programming
9
reactions
Comments
1
comment
3 min read
🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines
SANKET PATIL
SANKET PATIL
SANKET PATIL
Follow
Nov 29 '24
🚀 Beyond Data Ingestion: Advanced Strategies for Optimizing API Data Pipelines
#
dataengineering
#
azuredatafactory
#
apipagination
#
devops
4
reactions
Comments
1
comment
3 min read
SQL "SELECT INTO" vs "INSERT INTO SELECT" statements.
Danwycliff Ndwiga
Danwycliff Ndwiga
Danwycliff Ndwiga
Follow
Oct 30 '24
SQL "SELECT INTO" vs "INSERT INTO SELECT" statements.
#
sql
#
database
#
data
#
dataengineering
Comments
Add Comment
1 min read
ACID Properties in Databases: What Happens Without Them?
Meqdad Darwish
Meqdad Darwish
Meqdad Darwish
Follow
Nov 27 '24
ACID Properties in Databases: What Happens Without Them?
#
database
#
data
#
dataengineering
#
sql
5
reactions
Comments
Add Comment
6 min read
🕵️ OSINT: link company acronyms to Standard Occupation Classification w. Open Source LLMs
adriens
adriens
adriens
Follow
Nov 25 '24
🕵️ OSINT: link company acronyms to Standard Occupation Classification w. Open Source LLMs
#
ai
#
datascience
#
database
#
dataengineering
1
reaction
Comments
8
comments
6 min read
Data Architecture Best Practices
DQOps
DQOps
DQOps
Follow
Nov 23 '24
Data Architecture Best Practices
#
data
#
dataengineering
#
dataquality
#
datascience
1
reaction
Comments
Add Comment
6 min read
My Journey into Data AI and Machine Learning
Lusanda Ndlovu
Lusanda Ndlovu
Lusanda Ndlovu
Follow
Oct 20 '24
My Journey into Data AI and Machine Learning
#
softwaredevelopment
#
ai
#
machinelearning
#
dataengineering
Comments
Add Comment
1 min read
🚀 Unlock the Power of ORC File Format 📊
Pratik Barjatiya
Pratik Barjatiya
Pratik Barjatiya
Follow
Nov 22 '24
🚀 Unlock the Power of ORC File Format 📊
#
dataengineering
#
bigdata
#
datascience
#
data
5
reactions
Comments
Add Comment
1 min read
Designing robust and scalable relational databases: A series of best practices.
Pedro H Goncalves
Pedro H Goncalves
Pedro H Goncalves
Follow
Nov 19 '24
Designing robust and scalable relational databases: A series of best practices.
#
database
#
advanced
#
optimization
#
dataengineering
14
reactions
Comments
5
comments
17 min read
loading...
We're a blogging-forward open source social network where we learn from one another
Log in
Create account