Forem

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Compression algorithms in Parquet Java

Compression algorithms in Parquet Java

3
Comments 2
7 min read
Top 10 Tools for Efficient Web Scraping in 2025

Top 10 Tools for Efficient Web Scraping in 2025

3
Comments
4 min read
Goodbye Kafka: Build a Low-Cost User Analysis System

Goodbye Kafka: Build a Low-Cost User Analysis System

Comments
5 min read
The Columnar Approach: A Deep Dive into Efficient Data Storage for Analytics 🚀

The Columnar Approach: A Deep Dive into Efficient Data Storage for Analytics 🚀

1
Comments
4 min read
Introduction to Hadoop:)

Introduction to Hadoop:)

6
Comments
10 min read
Big Data Trends That Will Impact Your Business In 2025

Big Data Trends That Will Impact Your Business In 2025

5
Comments
6 min read
The Heart of DolphinScheduler: In-Depth Analysis of the Quartz Scheduling Framework

The Heart of DolphinScheduler: In-Depth Analysis of the Quartz Scheduling Framework

8
Comments
3 min read
SQL Filtering and Sorting with Real-life Examples

SQL Filtering and Sorting with Real-life Examples

1
Comments
4 min read
Query 1B Rows in PostgreSQL >25x Faster with Squirrels!

Query 1B Rows in PostgreSQL >25x Faster with Squirrels!

1
Comments 8
5 min read
Big Data

Big Data

Comments
1 min read
Introduction to Data lakes: The future of big data storage

Introduction to Data lakes: The future of big data storage

5
Comments
2 min read
Construyendo una aplicación con Change Data Capture (CDC) utilizando Debezium, Kafka y NiFi

Construyendo una aplicación con Change Data Capture (CDC) utilizando Debezium, Kafka y NiFi

1
Comments
3 min read
The Apache Iceberg™ Small File Problem

The Apache Iceberg™ Small File Problem

9
Comments
3 min read
System Design 09 - Data Partitioning: Dividing to Conquer Big Data

System Design 09 - Data Partitioning: Dividing to Conquer Big Data

Comments
2 min read
Introduction to Messaging Systems with Kafka

Introduction to Messaging Systems with Kafka

Comments
16 min read
Best Practices for Data Security in Big Data Projects

Best Practices for Data Security in Big Data Projects

Comments
6 min read
🚀 Unlock the Power of ORC File Format 📊

🚀 Unlock the Power of ORC File Format 📊

5
Comments
1 min read
SeaTunnel-Powered Data Integration: How 58 Group Handles Over 500 Billion+ Data Points Daily

SeaTunnel-Powered Data Integration: How 58 Group Handles Over 500 Billion+ Data Points Daily

5
Comments 2
5 min read
5 Big Data Use Cases that Retailers Fail to Use for Actionable Insights

5 Big Data Use Cases that Retailers Fail to Use for Actionable Insights

Comments
3 min read
Introduction to Big Data Analysis

Introduction to Big Data Analysis

8
Comments
13 min read
Understanding Star Schema vs. Snowflake Schema

Understanding Star Schema vs. Snowflake Schema

1
Comments
1 min read
Why Scala is the Best Choice for Big Data Applications: Advantages Over Java and Python

Why Scala is the Best Choice for Big Data Applications: Advantages Over Java and Python

Comments
6 min read
Processando 20 milhões de registros em menos de 5 segundos com Apache Hive.

Processando 20 milhões de registros em menos de 5 segundos com Apache Hive.

9
Comments
8 min read
SeaTunnel Community Monthly Report For September

SeaTunnel Community Monthly Report For September

Comments
14 min read
Effizientes Scrapen von JavaScript-Webseiten

Effizientes Scrapen von JavaScript-Webseiten

Comments
3 min read
loading...