Skip to content
Navigation menu
Search
Powered by
Search
Algolia
Log in
Create account
Forem
Close
#
bigdata
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Compression algorithms in Parquet Java
Jerónimo López
Jerónimo López
Jerónimo López
Follow
Jan 20
Compression algorithms in Parquet Java
#
parquet
#
java
#
compression
#
bigdata
3
reactions
Comments
2
comments
7 min read
Top 10 Tools for Efficient Web Scraping in 2025
WISDOMUDO
WISDOMUDO
WISDOMUDO
Follow
Jan 16
Top 10 Tools for Efficient Web Scraping in 2025
#
webscraping
#
datascience
#
automation
#
bigdata
3
reactions
Comments
Add Comment
4 min read
Goodbye Kafka: Build a Low-Cost User Analysis System
ksanaka
ksanaka
ksanaka
Follow
Dec 5 '24
Goodbye Kafka: Build a Low-Cost User Analysis System
#
database
#
kafka
#
bigdata
Comments
Add Comment
5 min read
The Columnar Approach: A Deep Dive into Efficient Data Storage for Analytics 🚀
Madhav
Madhav
Madhav
Follow
Jan 6
The Columnar Approach: A Deep Dive into Efficient Data Storage for Analytics 🚀
#
database
#
bigdata
#
dataengineering
#
analytics
1
reaction
Comments
Add Comment
4 min read
Introduction to Hadoop:)
Madhav Ganesan
Madhav Ganesan
Madhav Ganesan
Follow
Nov 24 '24
Introduction to Hadoop:)
#
hadoop
#
bigdata
#
nlp
#
llm
6
reactions
Comments
Add Comment
10 min read
Big Data Trends That Will Impact Your Business In 2025
TechDogs
TechDogs
TechDogs
Follow
for
TechDogs
Dec 24 '24
Big Data Trends That Will Impact Your Business In 2025
#
bigdata
#
trends
#
2025
#
technology
5
reactions
Comments
Add Comment
6 min read
The Heart of DolphinScheduler: In-Depth Analysis of the Quartz Scheduling Framework
Chen Debra
Chen Debra
Chen Debra
Follow
Nov 20 '24
The Heart of DolphinScheduler: In-Depth Analysis of the Quartz Scheduling Framework
#
apachedolphinscheduler
#
quartz
#
opensource
#
bigdata
8
reactions
Comments
Add Comment
3 min read
SQL Filtering and Sorting with Real-life Examples
Millie Molotov
Millie Molotov
Millie Molotov
Follow
Dec 23 '24
SQL Filtering and Sorting with Real-life Examples
#
database
#
sql
#
mysql
#
bigdata
1
reaction
Comments
Add Comment
4 min read
Query 1B Rows in PostgreSQL >25x Faster with Squirrels!
Tim Huang
Tim Huang
Tim Huang
Follow
Dec 18 '24
Query 1B Rows in PostgreSQL >25x Faster with Squirrels!
#
postgres
#
dataengineering
#
analytics
#
bigdata
1
reaction
Comments
8
comments
5 min read
Big Data
williamxlr
williamxlr
williamxlr
Follow
Nov 13 '24
Big Data
#
bigdata
#
hadoop
#
spark
Comments
Add Comment
1 min read
Introduction to Data lakes: The future of big data storage
Hiswill Thompson
Hiswill Thompson
Hiswill Thompson
Follow
Dec 14 '24
Introduction to Data lakes: The future of big data storage
#
bigdata
#
dataengineering
5
reactions
Comments
Add Comment
2 min read
Construyendo una aplicación con Change Data Capture (CDC) utilizando Debezium, Kafka y NiFi
Javier Andre Neira Machaca
Javier Andre Neira Machaca
Javier Andre Neira Machaca
Follow
Dec 14 '24
Construyendo una aplicación con Change Data Capture (CDC) utilizando Debezium, Kafka y NiFi
#
cdc
#
bigdata
1
reaction
Comments
Add Comment
3 min read
The Apache Iceberg™ Small File Problem
Danica Fine
Danica Fine
Danica Fine
Follow
Dec 11 '24
The Apache Iceberg™ Small File Problem
#
bigdata
#
apacheiceberg
#
datalakehouse
#
dataengineering
9
reactions
Comments
Add Comment
3 min read
System Design 09 - Data Partitioning: Dividing to Conquer Big Data
Sarva Bharan
Sarva Bharan
Sarva Bharan
Follow
Nov 12 '24
System Design 09 - Data Partitioning: Dividing to Conquer Big Data
#
systemdesign
#
bigdata
#
datapartition
Comments
Add Comment
2 min read
Introduction to Messaging Systems with Kafka
Yasmine Cherif
Yasmine Cherif
Yasmine Cherif
Follow
Nov 28 '24
Introduction to Messaging Systems with Kafka
#
distributedsystems
#
bigdata
#
kafka
#
programming
Comments
Add Comment
16 min read
Best Practices for Data Security in Big Data Projects
Aditya Pratap Bhuyan
Aditya Pratap Bhuyan
Aditya Pratap Bhuyan
Follow
Oct 24 '24
Best Practices for Data Security in Big Data Projects
#
bestpractices
#
bigdata
#
datasecurity
Comments
Add Comment
6 min read
🚀 Unlock the Power of ORC File Format 📊
Pratik Barjatiya
Pratik Barjatiya
Pratik Barjatiya
Follow
Nov 22 '24
🚀 Unlock the Power of ORC File Format 📊
#
dataengineering
#
bigdata
#
datascience
#
data
5
reactions
Comments
Add Comment
1 min read
SeaTunnel-Powered Data Integration: How 58 Group Handles Over 500 Billion+ Data Points Daily
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Nov 20 '24
SeaTunnel-Powered Data Integration: How 58 Group Handles Over 500 Billion+ Data Points Daily
#
datascience
#
apacheseatunnel
#
opensource
#
bigdata
5
reactions
Comments
2
comments
5 min read
5 Big Data Use Cases that Retailers Fail to Use for Actionable Insights
Dmytro Spilka
Dmytro Spilka
Dmytro Spilka
Follow
Oct 16 '24
5 Big Data Use Cases that Retailers Fail to Use for Actionable Insights
#
bigdata
Comments
Add Comment
3 min read
Introduction to Big Data Analysis
Madhav Ganesan
Madhav Ganesan
Madhav Ganesan
Follow
Nov 17 '24
Introduction to Big Data Analysis
#
bigdata
#
aws
#
hadoop
#
coding
8
reactions
Comments
Add Comment
13 min read
Understanding Star Schema vs. Snowflake Schema
Puneet Verma
Puneet Verma
Puneet Verma
Follow
Nov 16 '24
Understanding Star Schema vs. Snowflake Schema
#
dataengineering
#
datascience
#
datamodeling
#
bigdata
1
reaction
Comments
Add Comment
1 min read
Why Scala is the Best Choice for Big Data Applications: Advantages Over Java and Python
Aditya Pratap Bhuyan
Aditya Pratap Bhuyan
Aditya Pratap Bhuyan
Follow
Oct 11 '24
Why Scala is the Best Choice for Big Data Applications: Advantages Over Java and Python
#
scala
#
java
#
python
#
bigdata
Comments
Add Comment
6 min read
Processando 20 milhões de registros em menos de 5 segundos com Apache Hive.
Airton Lira junior
Airton Lira junior
Airton Lira junior
Follow
Nov 2 '24
Processando 20 milhões de registros em menos de 5 segundos com Apache Hive.
#
apachehive
#
hive
#
bigdata
#
hadoop
9
reactions
Comments
Add Comment
8 min read
SeaTunnel Community Monthly Report For September
Apache SeaTunnel
Apache SeaTunnel
Apache SeaTunnel
Follow
Oct 9 '24
SeaTunnel Community Monthly Report For September
#
developer
#
apacheseatunnel
#
opensource
#
bigdata
Comments
Add Comment
14 min read
Effizientes Scrapen von JavaScript-Webseiten
hanna Fischer
hanna Fischer
hanna Fischer
Follow
Nov 11 '24
Effizientes Scrapen von JavaScript-Webseiten
#
java
#
python
#
javascript
#
bigdata
Comments
Add Comment
3 min read
loading...
We're a blogging-forward open source social network where we learn from one another
Log in
Create account