Skip to content
Navigation menu
Search
Powered by
Search
Algolia
Log in
Create account
Forem
Close
#
bigdata
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
A Comprehensive Comparison of JuiceFS and HDFS for Cloud-Based Big Data Storage
tonybarber2
tonybarber2
tonybarber2
Follow
Apr 7 '23
A Comprehensive Comparison of JuiceFS and HDFS for Cloud-Based Big Data Storage
#
bigdata
#
opensource
#
cloud
2
reactions
Comments
Add Comment
11 min read
The Secret to Rapid Scaling: How Scraping Helped These Startups Go From Zero to $1.2+ Trillion
Tomas Laurinavicius
Tomas Laurinavicius
Tomas Laurinavicius
Follow
Mar 28 '23
The Secret to Rapid Scaling: How Scraping Helped These Startups Go From Zero to $1.2+ Trillion
#
startup
#
bigdata
#
scraping
#
datascience
7
reactions
Comments
1
comment
6 min read
Mastering Large-Scale Data Processing: Building a Data Pipeline with ApacheAGE for Efficient Ingestion, Processing, and Analysis
Humza Tareen
Humza Tareen
Humza Tareen
Follow
Mar 25 '23
Mastering Large-Scale Data Processing: Building a Data Pipeline with ApacheAGE for Efficient Ingestion, Processing, and Analysis
#
apacheage
#
postgres
#
datascience
#
bigdata
2
reactions
Comments
Add Comment
2 min read
How we mastered dbt: A true story
Olga Braginskaya
Olga Braginskaya
Olga Braginskaya
Follow
Mar 22 '23
How we mastered dbt: A true story
#
bigdata
#
dataengineering
#
dbt
#
tutorial
7
reactions
Comments
Add Comment
14 min read
Exploration of Spark Executor Memory
Lorenzo Lou
Lorenzo Lou
Lorenzo Lou
Follow
Mar 21 '23
Exploration of Spark Executor Memory
#
spark
#
programming
#
bigdata
2
reactions
Comments
Add Comment
9 min read
GETTING STARTED WITH SENTIMENT ANALYSIS.
BRENDA ATIENO ODHIAMBO
BRENDA ATIENO ODHIAMBO
BRENDA ATIENO ODHIAMBO
Follow
Mar 17 '23
GETTING STARTED WITH SENTIMENT ANALYSIS.
#
python
#
datascience
#
dataanalysis
#
bigdata
2
reactions
Comments
Add Comment
4 min read
Lightweight HTTP API for Big Data on S3
Paulius
Paulius
Paulius
Follow
for
Exacaster
Mar 15 '23
Lightweight HTTP API for Big Data on S3
#
deltalake
#
bigdata
#
opensource
#
s3
3
reactions
Comments
Add Comment
3 min read
How to cope with high-concurrency account query?
jbx1279
jbx1279
jbx1279
Follow
Mar 12 '23
How to cope with high-concurrency account query?
#
database
#
bigdata
#
performance
#
programming
Comments
Add Comment
6 min read
Don't Break the Bank on SQL Queries: BigQuery On-Demand vs Flat-Rate prices. Which Saves You More? 💰😎
Olga R
Olga R
Olga R
Follow
Mar 12 '23
Don't Break the Bank on SQL Queries: BigQuery On-Demand vs Flat-Rate prices. Which Saves You More? 💰😎
#
productivity
#
database
#
sql
#
bigdata
5
reactions
Comments
3
comments
5 min read
Read before-The Ultimate Guide to AWS IoT Core: What it is, How it helps, and Real-World use Cases. Mini-Project-Intro
Augusto Valdivia
Augusto Valdivia
Augusto Valdivia
Follow
for
AWS Community Builders
Mar 12 '23
Read before-The Ultimate Guide to AWS IoT Core: What it is, How it helps, and Real-World use Cases. Mini-Project-Intro
#
awsiotcore
#
terraform
#
iot
#
bigdata
10
reactions
Comments
Add Comment
3 min read
"Features of Data Lake Federated Analysis"_Apache Doris Summit 2022
31:03
SelectDB
SelectDB
SelectDB
Follow
Feb 7 '23
"Features of Data Lake Federated Analysis"_Apache Doris Summit 2022
#
database
#
bigdata
#
datalake
#
opensource
2
reactions
Comments
Add Comment
1 min read
Tencent Data Engineer: Why We Go from ClickHouse to Apache Doris?
Apache Doris
Apache Doris
Apache Doris
Follow
Mar 7 '23
Tencent Data Engineer: Why We Go from ClickHouse to Apache Doris?
#
database
#
datascience
#
bigdata
1
reaction
Comments
Add Comment
11 min read
ClickHouse is fast, esProc SPL is faster
jbx1279
jbx1279
jbx1279
Follow
Feb 27 '23
ClickHouse is fast, esProc SPL is faster
#
bigdata
#
database
#
sql
#
programming
1
reaction
Comments
Add Comment
10 min read
EXPLORATORY DATA ANALYSIS ULTIMATE GUIDE.
BRENDA ATIENO ODHIAMBO
BRENDA ATIENO ODHIAMBO
BRENDA ATIENO ODHIAMBO
Follow
Feb 24 '23
EXPLORATORY DATA ANALYSIS ULTIMATE GUIDE.
#
python
#
datascience
#
dataanalysis
#
bigdata
1
reaction
Comments
Add Comment
3 min read
How to use docker to compile Apache Doris
Lemon
Lemon
Lemon
Follow
Feb 24 '23
How to use docker to compile Apache Doris
#
apachedoris
#
doris
#
olap
#
bigdata
3
reactions
Comments
Add Comment
3 min read
Apache Doris be common problem positioning and processing
Lemon
Lemon
Lemon
Follow
Feb 24 '23
Apache Doris be common problem positioning and processing
#
apachedoris
#
doris
#
olap
#
bigdata
2
reactions
Comments
Add Comment
3 min read
Amazon Redshift: What, Why, and How
Vikas Solegaonkar
Vikas Solegaonkar
Vikas Solegaonkar
Follow
for
AWS Community Builders
Feb 20 '23
Amazon Redshift: What, Why, and How
#
redshift
#
aws
#
bigdata
#
database
2
reactions
Comments
1
comment
5 min read
Hadoop/Spark is too heavy, esProc SPL is light
jbx1279
jbx1279
jbx1279
Follow
Feb 6 '23
Hadoop/Spark is too heavy, esProc SPL is light
#
bigdata
#
database
#
programming
Comments
Add Comment
12 min read
The Tale of the Mexican Fisherman
Alex Hyett
Alex Hyett
Alex Hyett
Follow
Jan 29 '23
The Tale of the Mexican Fisherman
#
discuss
#
coding
#
bigdata
Comments
Add Comment
4 min read
How working/install Spark with Notebooks?
Lucas M. RÃos
Lucas M. RÃos
Lucas M. RÃos
Follow
Jan 16 '23
How working/install Spark with Notebooks?
#
python
#
datascience
#
bigdata
#
cloud
3
reactions
Comments
Add Comment
3 min read
Type of data in hadoop
shubham mishra
shubham mishra
shubham mishra
Follow
Jan 14 '23
Type of data in hadoop
#
bigdata
#
hadoop
#
datascience
2
reactions
Comments
Add Comment
2 min read
Observable and Newsletter. A comparison.
Ignazio Casamento
Ignazio Casamento
Ignazio Casamento
Follow
Jan 12 '23
Observable and Newsletter. A comparison.
#
data
#
bigdata
#
analytics
1
reaction
Comments
Add Comment
2 min read
Real Time Data Infra Stack
ChunTing Wu
ChunTing Wu
ChunTing Wu
Follow
Dec 5 '22
Real Time Data Infra Stack
#
eventdriven
#
architecture
#
tutorial
#
bigdata
4
reactions
Comments
Add Comment
6 min read
Example of applying CDC to JSON files with PySpark
romerito
romerito
romerito
Follow
Nov 30 '22
Example of applying CDC to JSON files with PySpark
#
cdc
#
spark
#
bigdata
#
deltalake
5
reactions
Comments
1
comment
7 min read
To study Apache Kafka Architecture in details, and how to install, deploy configure Apache kafka.
Ashwin Telmore
Ashwin Telmore
Ashwin Telmore
Follow
Nov 17 '22
To study Apache Kafka Architecture in details, and how to install, deploy configure Apache kafka.
#
bigdata
#
apache
#
kafka
#
manual
4
reactions
Comments
Add Comment
3 min read
loading...
We're a blogging-forward open source social network where we learn from one another
Log in
Create account