Forem

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why Data Quality Dimensions Are the Secret Ingredient for Data-Driven Success
Cover image for Why Data Quality Dimensions Are the Secret Ingredient for Data-Driven Success

Why Data Quality Dimensions Are the Secret Ingredient for Data-Driven Success

1
Comments
3 min read
Easily Integrate Databend Test Environment with Testcontainers
Cover image for Easily Integrate Databend Test Environment with Testcontainers

Easily Integrate Databend Test Environment with Testcontainers

4
Comments
4 min read
🤯 #NODES24: a practical path to Cloud-Native Knowledge Graph Automation & AI Agents
Cover image for 🤯 #NODES24: a practical path to Cloud-Native Knowledge Graph Automation & AI Agents

🤯 #NODES24: a practical path to Cloud-Native Knowledge Graph Automation & AI Agents

Comments 8
2 min read
When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability
Cover image for When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability

When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability

Comments
5 min read
The Columnar Approach: A Deep Dive into Efficient Data Storage for Analytics 🚀

The Columnar Approach: A Deep Dive into Efficient Data Storage for Analytics 🚀

1
Comments
4 min read
Why Feature Scaling Should Be Done After Splitting Your Dataset into Training and Test Sets
Cover image for Why Feature Scaling Should Be Done After Splitting Your Dataset into Training and Test Sets

Why Feature Scaling Should Be Done After Splitting Your Dataset into Training and Test Sets

3
Comments
3 min read
Should I add Data Science or Analytics to my skills?
Cover image for Should I add Data Science or Analytics to my skills?

Should I add Data Science or Analytics to my skills?

Comments
1 min read
Innowise is open for internships for Data Engineers and Data Analytics

Innowise is open for internships for Data Engineers and Data Analytics

Comments
1 min read
Exploring OSM changesets via DuckDB
Cover image for Exploring OSM changesets via DuckDB

Exploring OSM changesets via DuckDB

1
Comments
9 min read
Creating Stripe Test Data in Python
Cover image for Creating Stripe Test Data in Python

Creating Stripe Test Data in Python

2
Comments
4 min read
Are AWS Certifications Worth It in 2025?
Cover image for Are AWS Certifications Worth It in 2025?

Are AWS Certifications Worth It in 2025?

3
Comments 1
2 min read
Data ingestion using AWS Services, Part 1
Cover image for Data ingestion using AWS Services, Part 1

Data ingestion using AWS Services, Part 1

Comments
9 min read
Data Warehousing Architectures
Cover image for Data Warehousing Architectures

Data Warehousing Architectures

Comments
5 min read
Can AI finally generate best practice code? I think so.
Cover image for Can AI finally generate best practice code? I think so.

Can AI finally generate best practice code? I think so.

2
Comments
6 min read
Query 1B Rows in PostgreSQL >25x Faster with Squirrels!
Cover image for Query 1B Rows in PostgreSQL >25x Faster with Squirrels!

Query 1B Rows in PostgreSQL >25x Faster with Squirrels!

1
Comments 8
5 min read
📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data
Cover image for 📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data

📊 AI Dashboard Builder: Create Insightful Dashboards just Droppping your Data

5
Comments
2 min read
Introduction to Data lakes: The future of big data storage

Introduction to Data lakes: The future of big data storage

5
Comments
2 min read
Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack
Cover image for Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack

Explorer l'API de 360Learning : de l'agilité de Power Query à la robustesse de la Modern Data Stack

7
Comments
12 min read
Data Pipeline Filters 101: Choosing Between Static and Dynamic Approaches

Data Pipeline Filters 101: Choosing Between Static and Dynamic Approaches

Comments
1 min read
The Apache Iceberg™ Small File Problem

The Apache Iceberg™ Small File Problem

13
Comments
3 min read
Ensuring Data Quality: Best Practices and Automation

Ensuring Data Quality: Best Practices and Automation

Comments
6 min read
Data Science Simplified: Tips for Aspiring Data Scientists in 2025

Data Science Simplified: Tips for Aspiring Data Scientists in 2025

1
Comments
4 min read
2025 Guide to Architecting an Iceberg Lakehouse
Cover image for 2025 Guide to Architecting an Iceberg Lakehouse

2025 Guide to Architecting an Iceberg Lakehouse

5
Comments
14 min read
Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!
Cover image for Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!

Data Engineer as a Real-Time Algo Trader – Turning Pipelines into Profit (or at Least Trying)!

2
Comments
13 min read
One Off to One Data Platform: Design with Intent [Part 2]
Cover image for One Off to One Data Platform: Design with Intent [Part 2]

One Off to One Data Platform: Design with Intent [Part 2]

1
Comments
5 min read
loading...