Forem

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Auto-increment columns in Apache Doris

Auto-increment columns in Apache Doris

Comments
11 min read
What to use parquet or CSV?
Cover image for What to use parquet or CSV?

What to use parquet or CSV?

22
Comments
3 min read
Accelerating ETL Processes for Timely Business Intelligence
Cover image for Accelerating ETL Processes for Timely Business Intelligence

Accelerating ETL Processes for Timely Business Intelligence

Comments
4 min read
Are There “Queries over Trillion-Row Tables in Seconds”? Is “N-Times Faster Than ORACLE” an Exaggeration?

Are There “Queries over Trillion-Row Tables in Seconds”? Is “N-Times Faster Than ORACLE” an Exaggeration?

Comments
4 min read
A glimpse into the future of data processing infrastructure.

A glimpse into the future of data processing infrastructure.

Comments
9 min read
Safeguarding Data Quality By Addressing Data Privacy and Security Concerns
Cover image for Safeguarding Data Quality By Addressing Data Privacy and Security Concerns

Safeguarding Data Quality By Addressing Data Privacy and Security Concerns

1
Comments 1
4 min read
Best Practices for Designing an Efficient ETL Pipeline
Cover image for Best Practices for Designing an Efficient ETL Pipeline

Best Practices for Designing an Efficient ETL Pipeline

6
Comments
4 min read
The Role of Big Data Analytics in BFSI: Leveraging Data for Competitive Advantage
Cover image for The Role of Big Data Analytics in BFSI: Leveraging Data for Competitive Advantage

The Role of Big Data Analytics in BFSI: Leveraging Data for Competitive Advantage

Comments
4 min read
LLMs, DevOps, and Big Data Musings
Cover image for LLMs, DevOps, and Big Data Musings

LLMs, DevOps, and Big Data Musings

Comments
3 min read
Understanding and Mitigating Message Loss in Apache Kafka

Understanding and Mitigating Message Loss in Apache Kafka

20
Comments
9 min read
Snowflake 101: A Comprehensive Guide to the Data Cloud
Cover image for Snowflake 101: A Comprehensive Guide to the Data Cloud

Snowflake 101: A Comprehensive Guide to the Data Cloud

2
Comments
4 min read
Blockchain Technology and Data Governance: Enhancing Security and Trust
Cover image for Blockchain Technology and Data Governance: Enhancing Security and Trust

Blockchain Technology and Data Governance: Enhancing Security and Trust

1
Comments 1
4 min read
SQL Pro Tips : industrial AWS Athena SQL using WITH

SQL Pro Tips : industrial AWS Athena SQL using WITH

3
Comments
4 min read
SQL Pro Tips : industrial GCP BigQuery SQL using WITH

SQL Pro Tips : industrial GCP BigQuery SQL using WITH

3
Comments
5 min read
Tools Every Data Scientist Should Know
Cover image for Tools Every Data Scientist Should Know

Tools Every Data Scientist Should Know

Comments
2 min read
AI enthusiasm #3 - AlphaFold2, a game-changer🧬
Cover image for AI enthusiasm #3 - AlphaFold2, a game-changer🧬

AI enthusiasm #3 - AlphaFold2, a game-changer🧬

Comments
2 min read
Redis License Change: A Look at the Competitive Game between OSS and Cloud Computing Giants

Redis License Change: A Look at the Competitive Game between OSS and Cloud Computing Giants

Comments
5 min read
MWAA Plugins and Dependency Survival Guide
Cover image for MWAA Plugins and Dependency Survival Guide

MWAA Plugins and Dependency Survival Guide

6
Comments
3 min read
GenAI Model Optimization: Guide to Fine-Tuning and Quantization
Cover image for GenAI Model Optimization: Guide to Fine-Tuning and Quantization

GenAI Model Optimization: Guide to Fine-Tuning and Quantization

2
Comments
4 min read
What is Surrogate Key in SQL?

What is Surrogate Key in SQL?

Comments
2 min read
SQL Pro Tips : industrial Oracle SQL using WITH

SQL Pro Tips : industrial Oracle SQL using WITH

3
Comments
4 min read
How come there are tens of thousands of tables in a database

How come there are tens of thousands of tables in a database

2
Comments 1
5 min read
Data Streaming Architecture

Data Streaming Architecture

4
Comments
4 min read
Variant in Apache Doris 2.1.0: a new data type 8 times faster than JSON for semi-structured data analysis

Variant in Apache Doris 2.1.0: a new data type 8 times faster than JSON for semi-structured data analysis

Comments
12 min read
Amazon EMR deployment on EKS
Cover image for Amazon EMR deployment on EKS

Amazon EMR deployment on EKS

2
Comments
7 min read
loading...