Skip to content
Navigation menu
Search
Powered by
Search
Algolia
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Exploring OSM changesets via DuckDB
Pavel P.
Pavel P.
Pavel P.
Follow
Dec 28 '24
Exploring OSM changesets via DuckDB
#
osm
#
dataengineering
#
duckdb
#
database
1
reaction
Comments
Add Comment
9 min read
Creating Stripe Test Data in Python
Quinton
Quinton
Quinton
Follow
for
Airbyte
Dec 26 '24
Creating Stripe Test Data in Python
#
stripe
#
python
#
dataengineering
#
airbyte
2
reactions
Comments
Add Comment
4 min read
Setting up memory for Flink - Configuration
kination
kination
kination
Follow
Nov 22 '24
Setting up memory for Flink - Configuration
#
dataengineering
#
flink
#
memory
#
tuning
Comments
Add Comment
3 min read
Are AWS Certifications Worth It in 2025?
SkillBoostTrainer
SkillBoostTrainer
SkillBoostTrainer
Follow
Dec 25 '24
Are AWS Certifications Worth It in 2025?
#
aws
#
awscertification
#
dataengineering
#
cloudservices
3
reactions
Comments
Add Comment
2 min read
Query 1B Rows in PostgreSQL >25x Faster with Squirrels!
Tim Huang
Tim Huang
Tim Huang
Follow
Dec 18 '24
Query 1B Rows in PostgreSQL >25x Faster with Squirrels!
#
postgres
#
dataengineering
#
analytics
#
bigdata
1
reaction
Comments
8
comments
5 min read
Talend vs. Apache Kafka: Which Data Tool Drives Better Business Insights?
Hana Sato
Hana Sato
Hana Sato
Follow
Nov 13 '24
Talend vs. Apache Kafka: Which Data Tool Drives Better Business Insights?
#
datascience
#
devops
#
dataengineering
#
datastructures
Comments
Add Comment
6 min read
📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data
Pablo
Pablo
Pablo
Follow
Dec 15 '24
📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data
#
dataengineering
#
datascience
#
llm
#
python
2
reactions
Comments
Add Comment
2 min read
Introduction to Data lakes: The future of big data storage
Hiswill Thompson
Hiswill Thompson
Hiswill Thompson
Follow
Dec 14 '24
Introduction to Data lakes: The future of big data storage
#
bigdata
#
dataengineering
5
reactions
Comments
Add Comment
2 min read
Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack
Yann Barrault
Yann Barrault
Yann Barrault
Follow
for
Onepoint
Dec 14 '24
Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack
#
adventoftech2024
#
dataengineering
#
airbyte
#
dbt
7
reactions
Comments
Add Comment
12 min read
Data Pipeline Filters 101: Choosing Between Static and Dynamic Approaches
Muhammad Mahdi Ramadhan
Muhammad Mahdi Ramadhan
Muhammad Mahdi Ramadhan
Follow
Nov 9 '24
Data Pipeline Filters 101: Choosing Between Static and Dynamic Approaches
#
dataengineering
#
etl
#
dataanalyst
#
database
Comments
Add Comment
1 min read
The Apache Icebergâ„¢ Small File Problem
Danica Fine
Danica Fine
Danica Fine
Follow
Dec 11 '24
The Apache Icebergâ„¢ Small File Problem
#
bigdata
#
apacheiceberg
#
datalakehouse
#
dataengineering
9
reactions
Comments
Add Comment
3 min read
Ensuring Data Quality: Best Practices and Automation
BuzzGK
BuzzGK
BuzzGK
Follow
Nov 7 '24
Ensuring Data Quality: Best Practices and Automation
#
data
#
dataengineering
Comments
Add Comment
6 min read
Data Science Simplified: Tips for Aspiring Data Scientists in 2025
Vikas76
Vikas76
Vikas76
Follow
Nov 10 '24
Data Science Simplified: Tips for Aspiring Data Scientists in 2025
#
datascience
#
dataengineering
#
data
#
roadmap
1
reaction
Comments
Add Comment
4 min read
2025 Guide to Architecting an Iceberg Lakehouse
Alex Merced
Alex Merced
Alex Merced
Follow
Dec 9 '24
2025 Guide to Architecting an Iceberg Lakehouse
#
apacheiceberg
#
database
#
dataengineering
#
datascience
5
reactions
Comments
Add Comment
14 min read
Dremio, Apache Iceberg and their role in AI-Ready Data
Alex Merced
Alex Merced
Alex Merced
Follow
Nov 5 '24
Dremio, Apache Iceberg and their role in AI-Ready Data
#
database
#
dataengineering
#
datascience
#
dataanalytics
Comments
Add Comment
7 min read
Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!
SNEHASISH DUTTA
SNEHASISH DUTTA
SNEHASISH DUTTA
Follow
Dec 9 '24
Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!
#
sql
#
python
#
dataengineering
#
eventdriven
2
reactions
Comments
Add Comment
13 min read
One Off to One Data Platform: Design with Intent [Part 2]
Lulu Cheng
Lulu Cheng
Lulu Cheng
Follow
for
jarrid.xyz
Dec 8 '24
One Off to One Data Platform: Design with Intent [Part 2]
#
data
#
architecture
#
dataengineering
#
startup
1
reaction
Comments
Add Comment
5 min read
Case Study: Creating an ETL Data Pipeline using AWS Services - Real-World Problem
M. Abdullah Bin Aftab
M. Abdullah Bin Aftab
M. Abdullah Bin Aftab
Follow
Dec 6 '24
Case Study: Creating an ETL Data Pipeline using AWS Services - Real-World Problem
#
aws
#
dataengineering
#
data
#
cloudcomputing
Comments
Add Comment
2 min read
Choosing the right, real-time, Postgres CDC platform
Eric Goldman
Eric Goldman
Eric Goldman
Follow
for
Sequin
Dec 6 '24
Choosing the right, real-time, Postgres CDC platform
#
postgres
#
database
#
eventdriven
#
dataengineering
Comments
Add Comment
8 min read
ChatGPT Launches Pro: What's it Mean for Data Professionals?
Anthony Clemons
Anthony Clemons
Anthony Clemons
Follow
Dec 5 '24
ChatGPT Launches Pro: What's it Mean for Data Professionals?
#
openai
#
chatgpt
#
datascience
#
dataengineering
2
reactions
Comments
Add Comment
4 min read
Introduction to Apache Kafka
Hiswill Thompson
Hiswill Thompson
Hiswill Thompson
Follow
Dec 5 '24
Introduction to Apache Kafka
#
datapipeline
#
dataengineering
#
realtimedat
3
reactions
Comments
1
comment
3 min read
Mastering Workflow Automation with Apache Airflow for Data Engineering
Aditya Pratap Bhuyan
Aditya Pratap Bhuyan
Aditya Pratap Bhuyan
Follow
Oct 31 '24
Mastering Workflow Automation with Apache Airflow for Data Engineering
#
dataengineering
#
apacheairflow
Comments
Add Comment
6 min read
Mastering Twitter Data Collection: A Comprehensive Guide to Efficient Scraping Solutions
kaito
kaito
kaito
Follow
Dec 3 '24
Mastering Twitter Data Collection: A Comprehensive Guide to Efficient Scraping Solutions
#
twitter
#
datascience
#
dataengineering
#
development
Comments
Add Comment
3 min read
Seaborn Cheat Sheet
Arum Puri
Arum Puri
Arum Puri
Follow
Nov 30 '24
Seaborn Cheat Sheet
#
seaborn
#
datascience
#
cheatsheet
#
dataengineering
1
reaction
Comments
Add Comment
2 min read
Jupyter Notebooks in Docker
Hassan Aftab
Hassan Aftab
Hassan Aftab
Follow
Nov 29 '24
Jupyter Notebooks in Docker
#
datascience
#
docker
#
dataengineering
#
programming
9
reactions
Comments
1
comment
3 min read
loading...
We're a blogging-forward open source social network where we learn from one another
Log in
Create account