Forem

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Is Storage-Computing Separation Really Necessary? From the Architectural Debate to the Practical Analysis of Doris

Is Storage-Computing Separation Really Necessary? From the Architectural Debate to the Practical Analysis of Doris

1
Comments
4 min read
Building an Efficient and Cost-Effective Business Data Analytics System with Databend Cloud
Cover image for Building an Efficient and Cost-Effective Business Data Analytics System with Databend Cloud

Building an Efficient and Cost-Effective Business Data Analytics System with Databend Cloud

Comments
7 min read
🐼 Pandas Too Slow? Try These Fast Python Libraries for Data Analysis

🐼 Pandas Too Slow? Try These Fast Python Libraries for Data Analysis

Comments
1 min read
build-my-own-datalake: Starting from PoC

build-my-own-datalake: Starting from PoC

Comments
5 min read
The two versions of Parquet

The two versions of Parquet

2
Comments
5 min read
How to Load Datasets Efficiently in Pandas: A Complete Guide
Cover image for How to Load Datasets Efficiently in Pandas: A Complete Guide

How to Load Datasets Efficiently in Pandas: A Complete Guide

8
Comments 2
4 min read
Using Apache Parquet to Optimize Data Handling in a Real-Time Ad Exchange Platform

Using Apache Parquet to Optimize Data Handling in a Real-Time Ad Exchange Platform

2
Comments
3 min read
Mastering SQL for Data Engineering: Advanced Queries, Optimization, and Data Modeling Best Practices

Mastering SQL for Data Engineering: Advanced Queries, Optimization, and Data Modeling Best Practices

Comments
4 min read
MapReduce Simplified: Understand Distributed Processing with the Same Logic as SQL
Cover image for MapReduce Simplified: Understand Distributed Processing with the Same Logic as SQL

MapReduce Simplified: Understand Distributed Processing with the Same Logic as SQL

2
Comments
4 min read
How to Calculate the Return on Investment for Data Analytics
Cover image for How to Calculate the Return on Investment for Data Analytics

How to Calculate the Return on Investment for Data Analytics

1
Comments
5 min read
5 Game-Changing Habits to Master Your Data Science Journey
Cover image for 5 Game-Changing Habits to Master Your Data Science Journey

5 Game-Changing Habits to Master Your Data Science Journey

6
Comments
4 min read
Object Storage as Primary Storage: The MinIO Story
Cover image for Object Storage as Primary Storage: The MinIO Story

Object Storage as Primary Storage: The MinIO Story

3
Comments
7 min read
Rethinking distributed systems: Composability, scalability

Rethinking distributed systems: Composability, scalability

Comments
5 min read
Compression algorithms in Parquet Java

Compression algorithms in Parquet Java

3
Comments 2
7 min read
Top 10 Web Scraping Tools in 2025 (Free & Paid Options)
Cover image for Top 10 Web Scraping Tools in 2025 (Free & Paid Options)

Top 10 Web Scraping Tools in 2025 (Free & Paid Options)

9
Comments 4
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.