Forem

# bigdata

Posts

๐Ÿ‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
6 Essential Data Formats in Cloud Analytics: A Complete Guide with Examples

6 Essential Data Formats in Cloud Analytics: A Complete Guide with Examples

Comments
5 min read
Why Parquet Is Everywhere - And What Makes It Actually Fast?
Cover image for Why Parquet Is Everywhere - And What Makes It Actually Fast?

Why Parquet Is Everywhere - And What Makes It Actually Fast?

2
Comments
3 min read
Final Project Report 2| Apache SeaTunnel Adds Metalake Support

Final Project Report 2| Apache SeaTunnel Adds Metalake Support

Comments
4 min read
Final Project Report 1: Schema Evolution Support on Apache SeaTunnel Flink Engine

Final Project Report 1: Schema Evolution Support on Apache SeaTunnel Flink Engine

Comments
4 min read
Enabling Continuous Deployment with Amazon Elastic Container Service and Infrastructure as Code
Cover image for Enabling Continuous Deployment with Amazon Elastic Container Service and Infrastructure as Code

Enabling Continuous Deployment with Amazon Elastic Container Service and Infrastructure as Code

Comments
6 min read
From DataWareHouses to BigData Systems: What and Why - Questions that nobody asks, but you should!
Cover image for From DataWareHouses to BigData Systems: What and Why - Questions that nobody asks, but you should!

From DataWareHouses to BigData Systems: What and Why - Questions that nobody asks, but you should!

Comments
6 min read
Migration Case: From Azkaban to DolphinScheduler

Migration Case: From Azkaban to DolphinScheduler

Comments
4 min read
1 billion JSON records, 1-second query response: Apache Doris vs. ClickHouse, Elasticsearch, and PostgreSQL

1 billion JSON records, 1-second query response: Apache Doris vs. ClickHouse, Elasticsearch, and PostgreSQL

5
Comments
7 min read
The data lakehouse evolution

The data lakehouse evolution

Comments
11 min read
How to build real-time user-facing analytics with Kafka + Flink + Doris

How to build real-time user-facing analytics with Kafka + Flink + Doris

4
Comments
9 min read
Apache DolphinScheduler 3.3.2 Released! Major Updates in Performance and Stability

Apache DolphinScheduler 3.3.2 Released! Major Updates in Performance and Stability

Comments
3 min read
๐Ÿ“Š๐Ÿ” OpenSearch Dashboards: Optimizing Massive Data Queries (Big Data) with Asynchronous Search
Cover image for ๐Ÿ“Š๐Ÿ” OpenSearch Dashboards: Optimizing Massive Data Queries (Big Data) with Asynchronous Search

๐Ÿ“Š๐Ÿ” OpenSearch Dashboards: Optimizing Massive Data Queries (Big Data) with Asynchronous Search

3
Comments
2 min read
๐Ÿ Why Hive Exists - And Why Its Complexity Is Actually Necessary
Cover image for ๐Ÿ Why Hive Exists - And Why Its Complexity Is Actually Necessary

๐Ÿ Why Hive Exists - And Why Its Complexity Is Actually Necessary

2
Comments
3 min read
Building a Universal Lakehouse Catalog: Beyond Iceberg Tables
Cover image for Building a Universal Lakehouse Catalog: Beyond Iceberg Tables

Building a Universal Lakehouse Catalog: Beyond Iceberg Tables

Comments
10 min read
(1) Emerging Data Lakehouse Handbook (2025): Concepts and Design of Data Warehouse Layering

(1) Emerging Data Lakehouse Handbook (2025): Concepts and Design of Data Warehouse Layering

Comments
5 min read
Tutorial: Intro to Apache Iceberg with Apache Polaris and Apache Spark
Cover image for Tutorial: Intro to Apache Iceberg with Apache Polaris and Apache Spark

Tutorial: Intro to Apache Iceberg with Apache Polaris and Apache Spark

Comments
20 min read
Apache Spark เฆธเฆนเฆœเฆญเฆพเฆฌเง‡ เฆœเฆพเฆจเฆฟ

Apache Spark เฆธเฆนเฆœเฆญเฆพเฆฌเง‡ เฆœเฆพเฆจเฆฟ

1
Comments
1 min read
Fueling the Future: How Big Data and AI are Unlocking Green Hydrogen's Potential

Fueling the Future: How Big Data and AI are Unlocking Green Hydrogen's Potential

5
Comments
6 min read
Code Green: How Big Data and AI are Engineering a Sustainable Planet

Code Green: How Big Data and AI are Engineering a Sustainable Planet

Comments
8 min read
From APIs to Aquifers: A Developer's Guide to Smart Water Management Data

From APIs to Aquifers: A Developer's Guide to Smart Water Management Data

Comments
7 min read
Query Anything with SQL: Your Developer's Deep Dive into Apache Drill

Query Anything with SQL: Your Developer's Deep Dive into Apache Drill

Comments
8 min read
Why Your Analytics Queries Are Slow: A Deep Dive into Columnar Databases

Why Your Analytics Queries Are Slow: A Deep Dive into Columnar Databases

1
Comments
6 min read
๐—•๐—ถ๐—ด ๐——๐—ฎ๐˜๐—ฎ ๐—ฃ๐—ถ๐—ฝ๐—ฒ๐—น๐—ถ๐—ป๐—ฒ ๐—–๐—ต๐—ฒ๐—ฎ๐˜๐˜€๐—ต๐—ฒ๐—ฒ๐˜: ๐—”๐—ช๐—ฆ, ๐—”๐˜‡๐˜‚๐—ฟ๐—ฒ, ๐—ฎ๐—ป๐—ฑ ๐—š๐—–๐—ฃ

๐—•๐—ถ๐—ด ๐——๐—ฎ๐˜๐—ฎ ๐—ฃ๐—ถ๐—ฝ๐—ฒ๐—น๐—ถ๐—ป๐—ฒ ๐—–๐—ต๐—ฒ๐—ฎ๐˜๐˜€๐—ต๐—ฒ๐—ฒ๐˜: ๐—”๐—ช๐—ฆ, ๐—”๐˜‡๐˜‚๐—ฟ๐—ฒ, ๐—ฎ๐—ป๐—ฑ ๐—š๐—–๐—ฃ

2
Comments
1 min read
Code Green: How Your Data Skills Can Power Europe's Climate Revolution

Code Green: How Your Data Skills Can Power Europe's Climate Revolution

Comments
6 min read
Data in the Cloud: 6 Common Formats for Data Analytics

Data in the Cloud: 6 Common Formats for Data Analytics

Comments
3 min read
loading...