Forem

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Metadata for win — Apache Parquet

Metadata for win — Apache Parquet

Comments
5 min read
Remove unwanted partition data in Azure Synapse (SQL DW)

Remove unwanted partition data in Azure Synapse (SQL DW)

1
Comments
6 min read
Replacing Saas ETL with Python dlt: A painless experience for Yummy.eu

Replacing Saas ETL with Python dlt: A painless experience for Yummy.eu

2
Comments
3 min read
Simplifying SDMX Data Integration with Python

Simplifying SDMX Data Integration with Python

2
Comments
3 min read
Clustering vs Partitioning your Apache Iceberg Tables
Cover image for Clustering vs Partitioning your Apache Iceberg Tables

Clustering vs Partitioning your Apache Iceberg Tables

7
Comments
8 min read
From Messy Data to Super Mario Pipeline: My First Adventure in Data Engineering
Cover image for From Messy Data to Super Mario Pipeline: My First Adventure in Data Engineering

From Messy Data to Super Mario Pipeline: My First Adventure in Data Engineering

1
Comments
12 min read
The Data Professions
Cover image for The Data Professions

The Data Professions

1
Comments
3 min read
Database generated events: LiveSync’s database connector vs CDC
Cover image for Database generated events: LiveSync’s database connector vs CDC

Database generated events: LiveSync’s database connector vs CDC

4
Comments
5 min read
MySQL: Using and Enhancing `DATETIME` and `TIMESTAMP`

MySQL: Using and Enhancing `DATETIME` and `TIMESTAMP`

1
Comments
3 min read
Working with Parquet files in Java using Carpet

Working with Parquet files in Java using Carpet

1
Comments 1
6 min read
Analyzing Svenskalag Data using DBT and DuckDB
Cover image for Analyzing Svenskalag Data using DBT and DuckDB

Analyzing Svenskalag Data using DBT and DuckDB

1
Comments
4 min read
How I've implemented the Medallion architecture using Apache Spark and Apache Hdoop
Cover image for How I've implemented the Medallion architecture using Apache Spark and Apache Hdoop

How I've implemented the Medallion architecture using Apache Spark and Apache Hdoop

12
Comments
6 min read
Working with Dates and Times in SQL: Tips and Tricks
Cover image for Working with Dates and Times in SQL: Tips and Tricks

Working with Dates and Times in SQL: Tips and Tricks

Comments
3 min read
FastAPI for Data Applications: From Concept to Creation. Part I
Cover image for FastAPI for Data Applications: From Concept to Creation. Part I

FastAPI for Data Applications: From Concept to Creation. Part I

4
Comments
5 min read
Usando Consultas de Percolação do Elasticsearch, Netflix Aperfeiçoa Buscas Reversas Eficientemente

Usando Consultas de Percolação do Elasticsearch, Netflix Aperfeiçoa Buscas Reversas Eficientemente

1
Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.