Forem

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
From DataWareHouses to BigData Systems: What and Why - Questions that nobody asks, but you should!
Cover image for From DataWareHouses to BigData Systems: What and Why - Questions that nobody asks, but you should!

From DataWareHouses to BigData Systems: What and Why - Questions that nobody asks, but you should!

Comments
6 min read
Migration Case: From Azkaban to DolphinScheduler

Migration Case: From Azkaban to DolphinScheduler

Comments
4 min read
1 billion JSON records, 1-second query response: Apache Doris vs. ClickHouse, Elasticsearch, and PostgreSQL

1 billion JSON records, 1-second query response: Apache Doris vs. ClickHouse, Elasticsearch, and PostgreSQL

5
Comments
7 min read
🔥 Day 6: Essential PySpark DataFrame Transformations
Cover image for 🔥 Day 6: Essential PySpark DataFrame Transformations

🔥 Day 6: Essential PySpark DataFrame Transformations

Comments
2 min read
The data lakehouse evolution

The data lakehouse evolution

Comments
11 min read
How to build real-time user-facing analytics with Kafka + Flink + Doris

How to build real-time user-facing analytics with Kafka + Flink + Doris

4
Comments
9 min read
Apache DolphinScheduler 3.3.2 Released! Major Updates in Performance and Stability

Apache DolphinScheduler 3.3.2 Released! Major Updates in Performance and Stability

Comments
3 min read
Building a Universal Lakehouse Catalog: Beyond Iceberg Tables
Cover image for Building a Universal Lakehouse Catalog: Beyond Iceberg Tables

Building a Universal Lakehouse Catalog: Beyond Iceberg Tables

Comments
10 min read
(1) Emerging Data Lakehouse Handbook (2025): Concepts and Design of Data Warehouse Layering

(1) Emerging Data Lakehouse Handbook (2025): Concepts and Design of Data Warehouse Layering

Comments
5 min read
Apache Spark সহজভাবে জানি

Apache Spark সহজভাবে জানি

1
Comments
1 min read
Why Parquet Is Everywhere - And What Makes It Actually Fast?
Cover image for Why Parquet Is Everywhere - And What Makes It Actually Fast?

Why Parquet Is Everywhere - And What Makes It Actually Fast?

2
Comments
3 min read
Fueling the Future: How Big Data and AI are Unlocking Green Hydrogen's Potential

Fueling the Future: How Big Data and AI are Unlocking Green Hydrogen's Potential

5
Comments
6 min read
Code Green: How Big Data and AI are Engineering a Sustainable Planet

Code Green: How Big Data and AI are Engineering a Sustainable Planet

Comments
8 min read
From APIs to Aquifers: A Developer's Guide to Smart Water Management Data

From APIs to Aquifers: A Developer's Guide to Smart Water Management Data

Comments
7 min read
Query Anything with SQL: Your Developer's Deep Dive into Apache Drill

Query Anything with SQL: Your Developer's Deep Dive into Apache Drill

Comments
8 min read
Why Your Analytics Queries Are Slow: A Deep Dive into Columnar Databases

Why Your Analytics Queries Are Slow: A Deep Dive into Columnar Databases

1
Comments
6 min read
𝗕𝗶𝗴 𝗗𝗮𝘁𝗮 𝗣𝗶𝗽𝗲𝗹𝗶𝗻𝗲 𝗖𝗵𝗲𝗮𝘁𝘀𝗵𝗲𝗲𝘁: 𝗔𝗪𝗦, 𝗔𝘇𝘂𝗿𝗲, 𝗮𝗻𝗱 𝗚𝗖𝗣

𝗕𝗶𝗴 𝗗𝗮𝘁𝗮 𝗣𝗶𝗽𝗲𝗹𝗶𝗻𝗲 𝗖𝗵𝗲𝗮𝘁𝘀𝗵𝗲𝗲𝘁: 𝗔𝗪𝗦, 𝗔𝘇𝘂𝗿𝗲, 𝗮𝗻𝗱 𝗚𝗖𝗣

2
Comments
1 min read
Code Green: How Your Data Skills Can Power Europe's Climate Revolution

Code Green: How Your Data Skills Can Power Europe's Climate Revolution

Comments
6 min read
Data in the Cloud: 6 Common Formats for Data Analytics

Data in the Cloud: 6 Common Formats for Data Analytics

Comments
3 min read
The Symfony/HttpClient Cookbook: 4 Enterprise Patterns You Haven’t Seen
Cover image for The Symfony/HttpClient Cookbook: 4 Enterprise Patterns You Haven’t Seen

The Symfony/HttpClient Cookbook: 4 Enterprise Patterns You Haven’t Seen

5
Comments
11 min read
Code for a Better Planet: Hacking UN SDGs 7-12 with Big Data

Code for a Better Planet: Hacking UN SDGs 7-12 with Big Data

4
Comments
7 min read
📊🔍 OpenSearch Dashboards: Optimizing Massive Data Queries (Big Data) with Asynchronous Search
Cover image for 📊🔍 OpenSearch Dashboards: Optimizing Massive Data Queries (Big Data) with Asynchronous Search

📊🔍 OpenSearch Dashboards: Optimizing Massive Data Queries (Big Data) with Asynchronous Search

6
Comments
2 min read
From Petabytes to Planet: A Developer's Guide to Big Data in Sustainability

From Petabytes to Planet: A Developer's Guide to Big Data in Sustainability

3
Comments
8 min read
Data Formats Every Data Analyst Should Know
Cover image for Data Formats Every Data Analyst Should Know

Data Formats Every Data Analyst Should Know

1
Comments
4 min read
Data-Driven Development: Leveraging Big Data for Smarter Coding

Data-Driven Development: Leveraging Big Data for Smarter Coding

2
Comments
5 min read
loading...