Forem

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
My First Data Pipeline Project Using Airflow, Docker & Postgres (COVID API Edition)
Cover image for My First Data Pipeline Project Using Airflow, Docker & Postgres (COVID API Edition)

My First Data Pipeline Project Using Airflow, Docker & Postgres (COVID API Edition)

6
Comments
2 min read
A Practical Guide to MLOps on AWS: Transforming Raw Data into AI-Ready Datasets with AWS Glue (Phase 02)

A Practical Guide to MLOps on AWS: Transforming Raw Data into AI-Ready Datasets with AWS Glue (Phase 02)

1
Comments 2
8 min read
How Excel is Used in Real-World Data Analysis

How Excel is Used in Real-World Data Analysis

3
Comments
3 min read
A Practical Guide to MLOps on AWS: Streaming Data Ingestion with Kinesis Firehose (Phase 01)

A Practical Guide to MLOps on AWS: Streaming Data Ingestion with Kinesis Firehose (Phase 01)

Comments 2
8 min read
#34 50 Advanced SQL Queries Every Developer Should Know
Cover image for #34 50 Advanced SQL Queries Every Developer Should Know

#34 50 Advanced SQL Queries Every Developer Should Know

2
Comments
7 min read
Setting Up Presto with Apache Superset using Docker 🐳 : Hands-On Guide
Cover image for Setting Up Presto with Apache Superset using Docker 🐳 : Hands-On Guide

Setting Up Presto with Apache Superset using Docker 🐳 : Hands-On Guide

Comments 2
3 min read
Understanding Consistency in PostgreSQL: A Deep Dive into the “C” in ACID

Understanding Consistency in PostgreSQL: A Deep Dive into the “C” in ACID

1
Comments
3 min read
What I Learned Cleaning 1 Million Rows of CSV Data Without Pandas
Cover image for What I Learned Cleaning 1 Million Rows of CSV Data Without Pandas

What I Learned Cleaning 1 Million Rows of CSV Data Without Pandas

5
Comments
2 min read
Building My First Real-Time Dashboard with ClickHouse and Streamlit: TrendLite Breakdown
Cover image for Building My First Real-Time Dashboard with ClickHouse and Streamlit: TrendLite Breakdown

Building My First Real-Time Dashboard with ClickHouse and Streamlit: TrendLite Breakdown

4
Comments
2 min read
From Reddit Trolls to Real-Time Analytics: Building an LLM-Powered Flink Deployment System

From Reddit Trolls to Real-Time Analytics: Building an LLM-Powered Flink Deployment System

3
Comments 1
7 min read
How to Handle Big Data Transformations Without Pandas (and My Favorite Workarounds)
Cover image for How to Handle Big Data Transformations Without Pandas (and My Favorite Workarounds)

How to Handle Big Data Transformations Without Pandas (and My Favorite Workarounds)

5
Comments
3 min read
The Ultimate Linux Command Cheat Sheet for Data Engineers and Analysts

The Ultimate Linux Command Cheat Sheet for Data Engineers and Analysts

88
Comments 4
4 min read
Interesting links - April 2025

Interesting links - April 2025

Comments
6 min read
Building a Stock Data Pipeline with requests, Apache Airflow and PostgreSQL
Cover image for Building a Stock Data Pipeline with requests, Apache Airflow and PostgreSQL

Building a Stock Data Pipeline with requests, Apache Airflow and PostgreSQL

1
Comments
4 min read
The Underrated Soft Skills That Make Great Data Engineers

The Underrated Soft Skills That Make Great Data Engineers

2
Comments 2
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.