Skip to content
Navigation menu
Search
Powered by
Search
Algolia
Log in
Create account
Forem
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Understanding Data Pipelines: The Backbone of Modern Data Systems
Rithesh Raj
Rithesh Raj
Rithesh Raj
Follow
Apr 6
Understanding Data Pipelines: The Backbone of Modern Data Systems
#
dataengineering
#
etl
#
python
#
gcp
1
reaction
Comments
Add Comment
3 min read
🚀 Building an ETL Pipeline with Python to Scrape Internship Jobs and Load into Excel
Denzel Kanyeki
Denzel Kanyeki
Denzel Kanyeki
Follow
Apr 6
🚀 Building an ETL Pipeline with Python to Scrape Internship Jobs and Load into Excel
#
dataengineering
#
python
#
programming
1
reaction
Comments
2
comments
4 min read
Stock Data Extraction Using Apache Kafka
Milcah03
Milcah03
Milcah03
Follow
Apr 6
Stock Data Extraction Using Apache Kafka
#
apachekafka
#
python
#
cassandra
#
dataengineering
6
reactions
Comments
Add Comment
4 min read
Distributed Model Serving Patterns
Emin Mammadov
Emin Mammadov
Emin Mammadov
Follow
Apr 5
Distributed Model Serving Patterns
#
distributedsystems
#
mlops
#
machinelearning
#
dataengineering
1
reaction
Comments
Add Comment
4 min read
Unified Data Analytics via a Semantic Layer
John Aitchison
John Aitchison
John Aitchison
Follow
Feb 27
Unified Data Analytics via a Semantic Layer
#
analytics
#
dataengineering
Comments
Add Comment
3 min read
Thinking about becoming a Data Engineer?
Henry Clapton
Henry Clapton
Henry Clapton
Follow
Feb 26
Thinking about becoming a Data Engineer?
#
datascience
#
dataengineering
Comments
Add Comment
2 min read
Apache SeaTunnel 2.3.8 JDBC Connector Development Guide
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Feb 26
Apache SeaTunnel 2.3.8 JDBC Connector Development Guide
#
softwaredevelopment
#
bigdata
#
dataengineering
Comments
Add Comment
12 min read
Slash your cost by 90% with Apache Doris Compute-Storage Decoupled Mode
Apache Doris
Apache Doris
Apache Doris
Follow
Mar 21
Slash your cost by 90% with Apache Doris Compute-Storage Decoupled Mode
#
database
#
opensource
#
cloudcomputing
#
dataengineering
Comments
Add Comment
9 min read
🔬Public docker images Trivy scans as duckdb datas on Kaggle
adriens
adriens
adriens
Follow
Mar 31
🔬Public docker images Trivy scans as duckdb datas on Kaggle
#
cybersecurity
#
dataengineering
#
jupyter
#
datascience
Comments
3
comments
1 min read
🐼 Pandas Too Slow? Try These Fast Python Libraries for Data Analysis
Aleksei Aleinikov
Aleksei Aleinikov
Aleksei Aleinikov
Follow
Mar 21
🐼 Pandas Too Slow? Try These Fast Python Libraries for Data Analysis
#
python
#
dataengineering
#
pandas
#
bigdata
Comments
Add Comment
1 min read
Architecting High-Performance Data Pipelines with Modern ETL | Spiral Mantra
Karan Singh
Karan Singh
Karan Singh
Follow
Apr 1
Architecting High-Performance Data Pipelines with Modern ETL | Spiral Mantra
#
dataengineering
#
cloud
#
performance
#
bigdata
Comments
Add Comment
1 min read
How to Optimize SQL Queries for Speed and Efficiency
BiKodes
BiKodes
BiKodes
Follow
Mar 31
How to Optimize SQL Queries for Speed and Efficiency
#
sql
#
sqlserver
#
database
#
dataengineering
2
reactions
Comments
2
comments
5 min read
Study Notes 4.3.2 - Testing and Documenting the Project
Pizofreude
Pizofreude
Pizofreude
Follow
Feb 25
Study Notes 4.3.2 - Testing and Documenting the Project
#
dataengineering
#
dezoomcamp
#
dbt
#
documentation
Comments
Add Comment
3 min read
Study Notes 4.3.1 - Build the First dbt Models
Pizofreude
Pizofreude
Pizofreude
Follow
Feb 25
Study Notes 4.3.1 - Build the First dbt Models
#
dataengineering
#
dezoomcamp
#
dbt
#
datamodeling
Comments
Add Comment
4 min read
Study Notes 4.5.2: Visualizing Data with Metabase (Alternative B)
Pizofreude
Pizofreude
Pizofreude
Follow
Feb 25
Study Notes 4.5.2: Visualizing Data with Metabase (Alternative B)
#
dataengineering
#
dezoomcamp
#
dbt
#
metabase
Comments
Add Comment
4 min read
Study Notes 4.4.1 | 4.4.2 : Deployment Using dbt Cloud and Locally
Pizofreude
Pizofreude
Pizofreude
Follow
Feb 25
Study Notes 4.4.1 | 4.4.2 : Deployment Using dbt Cloud and Locally
#
dataengineering
#
dezoomcamp
#
dbtcore
#
dbtcloud
Comments
Add Comment
3 min read
Study Notes 4.2.1 | 4.2.2: DBT Project Setup
Pizofreude
Pizofreude
Pizofreude
Follow
Feb 25
Study Notes 4.2.1 | 4.2.2: DBT Project Setup
#
dataengineering
#
dezoomcamp
#
dbt
#
analyticsengineering
Comments
Add Comment
2 min read
Study Notes 4.1.2: What is dbt?
Pizofreude
Pizofreude
Pizofreude
Follow
Feb 25
Study Notes 4.1.2: What is dbt?
#
dataengineering
#
dezoomcamp
#
dbt
#
analyticsengineering
Comments
Add Comment
3 min read
Study Notes 4.1.1 - Analytics Engineering Basics
Pizofreude
Pizofreude
Pizofreude
Follow
Feb 25
Study Notes 4.1.1 - Analytics Engineering Basics
#
dataengineering
#
dezoomcamp
#
dbt
#
analyticsengineering
Comments
Add Comment
4 min read
Apache Airflow for Data Engineering: Best Practices and Real-World Examples
Eric Katumo
Eric Katumo
Eric Katumo
Follow
Mar 30
Apache Airflow for Data Engineering: Best Practices and Real-World Examples
#
dataengineering
#
data
#
apacheairflow
6
reactions
Comments
1
comment
7 min read
Set up Graph Databases in Large-Scale Applications for Complex Data Management
Florian Zeba
Florian Zeba
Florian Zeba
Follow
Feb 22
Set up Graph Databases in Large-Scale Applications for Complex Data Management
#
database
#
dataengineering
#
datastructures
#
programming
Comments
Add Comment
3 min read
Implementing MLOps within Data Engineering Workflows for Efficient Machine Learning Model Deployment
Florian Zeba
Florian Zeba
Florian Zeba
Follow
Feb 21
Implementing MLOps within Data Engineering Workflows for Efficient Machine Learning Model Deployment
#
machinelearning
#
ai
#
dataengineering
#
mlops
Comments
Add Comment
3 min read
Stop flattening your JSON just to export it to Excel
Alexandre Manuel
Alexandre Manuel
Alexandre Manuel
Follow
Mar 27
Stop flattening your JSON just to export it to Excel
#
python
#
excel
#
programming
#
dataengineering
Comments
Add Comment
2 min read
Ghost models and spooky manifests in dbt
Aram Panasenco
Aram Panasenco
Aram Panasenco
Follow
Feb 21
Ghost models and spooky manifests in dbt
#
dbt
#
dataengineering
#
analytics
Comments
Add Comment
4 min read
Implementing Advanced Data Governance in Hybrid and Multi-Cloud Environments
Florian Zeba
Florian Zeba
Florian Zeba
Follow
Feb 21
Implementing Advanced Data Governance in Hybrid and Multi-Cloud Environments
#
data
#
database
#
dataengineering
#
datastructures
Comments
Add Comment
4 min read
loading...
We're a blogging-forward open source social network where we learn from one another
Log in
Create account