Forem

# dataengineering

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Why Data Quality Dimensions Are the Secret Ingredient for Data-Driven Success
Cover image for Why Data Quality Dimensions Are the Secret Ingredient for Data-Driven Success

Why Data Quality Dimensions Are the Secret Ingredient for Data-Driven Success

1
Comments
3 min read
Easily Integrate Databend Test Environment with Testcontainers
Cover image for Easily Integrate Databend Test Environment with Testcontainers

Easily Integrate Databend Test Environment with Testcontainers

4
Comments
4 min read
When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability
Cover image for When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability

When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability

Comments
5 min read
Why Feature Scaling Should Be Done After Splitting Your Dataset into Training and Test Sets
Cover image for Why Feature Scaling Should Be Done After Splitting Your Dataset into Training and Test Sets

Why Feature Scaling Should Be Done After Splitting Your Dataset into Training and Test Sets

5
Comments
3 min read
Exploring OSM changesets via DuckDB
Cover image for Exploring OSM changesets via DuckDB

Exploring OSM changesets via DuckDB

1
Comments
9 min read
Creating Stripe Test Data in Python
Cover image for Creating Stripe Test Data in Python

Creating Stripe Test Data in Python

2
Comments
4 min read
Data Warehousing Architectures
Cover image for Data Warehousing Architectures

Data Warehousing Architectures

Comments
5 min read
Can AI finally generate best practice code? I think so.
Cover image for Can AI finally generate best practice code? I think so.

Can AI finally generate best practice code? I think so.

2
Comments
6 min read
Query 1B Rows in PostgreSQL >25x Faster with Squirrels!
Cover image for Query 1B Rows in PostgreSQL >25x Faster with Squirrels!

Query 1B Rows in PostgreSQL >25x Faster with Squirrels!

1
Comments 8
5 min read
📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data
Cover image for 📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data

📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data

6
Comments
2 min read
Introduction to Data lakes: The future of big data storage

Introduction to Data lakes: The future of big data storage

5
Comments
2 min read
Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack
Cover image for Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack

Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack

7
Comments
12 min read
The Apache Iceberg™ Small File Problem

The Apache Iceberg™ Small File Problem

13
Comments
3 min read
2025 Guide to Architecting an Iceberg Lakehouse
Cover image for 2025 Guide to Architecting an Iceberg Lakehouse

2025 Guide to Architecting an Iceberg Lakehouse

5
Comments
14 min read
Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!
Cover image for Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!

Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!

2
Comments
13 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.