Forem

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Data Science vs Business Analytics

Data Science vs Business Analytics

Comments
1 min read
Big Data Fundamentals: data pipeline example

Big Data Fundamentals: data pipeline example

Comments
6 min read
Big Data Fundamentals: data pipeline

Big Data Fundamentals: data pipeline

Comments
6 min read
What Is Change Data Capture (CDC) and How It Works on Google Cloud

What Is Change Data Capture (CDC) and How It Works on Google Cloud

Comments
2 min read
💾 Parquet or Avro? CSV or JSON?

💾 Parquet or Avro? CSV or JSON?

Comments
1 min read
Reading CSVs with varying column counts that pandas cannot read using DuckDB

Reading CSVs with varying column counts that pandas cannot read using DuckDB

1
Comments
3 min read
Working with Apache to automate collection of Weather data for Kenya’s major Agricultural Areas
Cover image for Working with Apache to automate collection of Weather data for Kenya’s major Agricultural Areas

Working with Apache to automate collection of Weather data for Kenya’s major Agricultural Areas

Comments
5 min read
Big Data Fundamentals: data warehouse example

Big Data Fundamentals: data warehouse example

Comments
5 min read
Big Data Fundamentals: data warehouse

Big Data Fundamentals: data warehouse

Comments
6 min read
Big Data Fundamentals: data lake with python

Big Data Fundamentals: data lake with python

Comments
6 min read
Big Data Fundamentals: data lake tutorial

Big Data Fundamentals: data lake tutorial

Comments
6 min read
Big Data Fundamentals: data lake project

Big Data Fundamentals: data lake project

Comments
6 min read
Big Data Fundamentals: data lake example

Big Data Fundamentals: data lake example

Comments
6 min read
What Is Data Lineage? Learn How to Trace Your Data’s Journey

What Is Data Lineage? Learn How to Trace Your Data’s Journey

Comments
2 min read
Building a Data Career: The Skills That Truly Matter
Cover image for Building a Data Career: The Skills That Truly Matter

Building a Data Career: The Skills That Truly Matter

10
Comments
5 min read
You Can't Trust COUNT and SUM: Scalable Data Validation with Merkle Trees
Cover image for You Can't Trust COUNT and SUM: Scalable Data Validation with Merkle Trees

You Can't Trust COUNT and SUM: Scalable Data Validation with Merkle Trees

2
Comments 1
8 min read
Unable to emit metadata to DataHub GMS with Airflow - a solution
Cover image for Unable to emit metadata to DataHub GMS with Airflow - a solution

Unable to emit metadata to DataHub GMS with Airflow - a solution

Comments
4 min read
Snowflake RBAC 101 – Episode 2: Role Hierarchies & Least Privilege

Snowflake RBAC 101 – Episode 2: Role Hierarchies & Least Privilege

Comments
1 min read
Lightweight ETL with AWS Glue Python Shell, DuckDB, and PyIceberg

Lightweight ETL with AWS Glue Python Shell, DuckDB, and PyIceberg

3
Comments 1
7 min read
PyIceberg on AWS Lambda: Comparing GlueCatalog and REST Catalog Access Methods

PyIceberg on AWS Lambda: Comparing GlueCatalog and REST Catalog Access Methods

2
Comments
3 min read
Big Data Fundamentals: data lake

Big Data Fundamentals: data lake

Comments
6 min read
Big Data Fundamentals: delta lake with python

Big Data Fundamentals: delta lake with python

Comments
6 min read
The Rise of Real-Time Data: Why Batch Might Be Fading
Cover image for The Rise of Real-Time Data: Why Batch Might Be Fading

The Rise of Real-Time Data: Why Batch Might Be Fading

10
Comments
3 min read
📚 A Complete Guide to Data Science Courses: How to Choose, What to Learn, and Where to Begin

📚 A Complete Guide to Data Science Courses: How to Choose, What to Learn, and Where to Begin

Comments
5 min read
Engineering with SOLID, DRY, KISS, YAGNI and GRASP
Cover image for Engineering with SOLID, DRY, KISS, YAGNI and GRASP

Engineering with SOLID, DRY, KISS, YAGNI and GRASP

1
Comments
16 min read
loading...