Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
dataengineering
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Apache Doris for log and time series data analysis in NetEase, why not Elasticsearch and InfluxDB?
Apache Doris
Apache Doris
Apache Doris
Follow
Jul 8 '24
Apache Doris for log and time series data analysis in NetEase, why not Elasticsearch and InfluxDB?
#
datascience
#
database
#
dataengineering
#
opensource
1
 reaction
Comments
Add Comment
9 min read
MapReduce Vs Tez
Shivansh Yadav
Shivansh Yadav
Shivansh Yadav
Follow
Jul 7 '24
MapReduce Vs Tez
#
hadoop
#
dataengineering
#
database
#
datascience
6
 reactions
Comments
Add Comment
2 min read
Azure Synapse Analytics Security: Data Protection
Ayush Kumar
Ayush Kumar
Ayush Kumar
Follow
Jul 4 '24
Azure Synapse Analytics Security: Data Protection
#
azure
#
sqlserver
#
dataengineering
3
 reactions
Comments
Add Comment
6 min read
Leveraging PySpark.Pandas for Efficient Data Pipelines
Felipe de Godoy
Felipe de Godoy
Felipe de Godoy
Follow
Jul 4 '24
Leveraging PySpark.Pandas for Efficient Data Pipelines
#
dataengineering
#
spark
#
pandas
#
python
Comments
Add Comment
3 min read
Why Apache Doris is the Best Open Source Alternative to Rockset
Apache Doris
Apache Doris
Apache Doris
Follow
Jul 1 '24
Why Apache Doris is the Best Open Source Alternative to Rockset
#
database
#
bigdata
#
dataengineering
#
openai
3
 reactions
Comments
Add Comment
3 min read
Apache Spark-Structured Streaming :: Cab Aggregator Use-case
SNEHASISH DUTTA
SNEHASISH DUTTA
SNEHASISH DUTTA
Follow
Jun 30 '24
Apache Spark-Structured Streaming :: Cab Aggregator Use-case
#
apachespark
#
dataengineering
#
streaming
#
realtimedata
1
 reaction
Comments
Add Comment
4 min read
Introduction to Apache Hadoop & MapReduce
Shivansh Yadav
Shivansh Yadav
Shivansh Yadav
Follow
Jun 30 '24
Introduction to Apache Hadoop & MapReduce
#
hadoop
#
dataengineering
#
bigdata
#
datascience
5
 reactions
Comments
Add Comment
3 min read
Analytics don't want duplicated data, so get it exactly-once with Flink/Kafka
kination
kination
kination
Follow
Jun 29 '24
Analytics don't want duplicated data, so get it exactly-once with Flink/Kafka
#
flink
#
kafka
#
dataengineering
Comments
Add Comment
3 min read
Metadata for win — Apache Parquet
Rahul Dubey
Rahul Dubey
Rahul Dubey
Follow
May 25 '24
Metadata for win — Apache Parquet
#
python
#
bigdata
#
datascience
#
dataengineering
Comments
Add Comment
5 min read
Remove unwanted partition data in Azure Synapse (SQL DW)
Ayush Kumar
Ayush Kumar
Ayush Kumar
Follow
Jun 24 '24
Remove unwanted partition data in Azure Synapse (SQL DW)
#
dataengineering
#
sqlserver
#
azure
1
 reaction
Comments
Add Comment
6 min read
Replacing Saas ETL with Python dlt: A painless experience for Yummy.eu
Aman Gupta
Aman Gupta
Aman Gupta
Follow
Jun 24 '24
Replacing Saas ETL with Python dlt: A painless experience for Yummy.eu
#
saasetl
#
dataengineering
#
python
2
 reactions
Comments
Add Comment
3 min read
Simplifying SDMX Data Integration with Python
Aman Gupta
Aman Gupta
Aman Gupta
Follow
Jun 24 '24
Simplifying SDMX Data Integration with Python
#
dataengineering
#
pipeline
#
etl
#
sdmx
2
 reactions
Comments
Add Comment
3 min read
Unlocking the Power of Large Language Models (LLMs): Your Ultimate Guide
FuturisticGeeks
FuturisticGeeks
FuturisticGeeks
Follow
Jun 24 '24
Unlocking the Power of Large Language Models (LLMs): Your Ultimate Guide
#
webdev
#
llm
#
python
#
dataengineering
6
 reactions
Comments
Add Comment
1 min read
Clustering vs Partitioning your Apache Iceberg Tables
Alex Merced
Alex Merced
Alex Merced
Follow
Jun 21 '24
Clustering vs Partitioning your Apache Iceberg Tables
#
database
#
datascience
#
dataengineering
#
data
7
 reactions
Comments
Add Comment
8 min read
From Messy Data to Super Mario Pipeline: My First Adventure in Data Engineering
Jampa Matos
Jampa Matos
Jampa Matos
Follow
Jun 20 '24
From Messy Data to Super Mario Pipeline: My First Adventure in Data Engineering
#
dataengineering
#
python
#
automation
#
sql
1
 reaction
Comments
Add Comment
12 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account