Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
bigdata
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Introduction to Apache Hadoop & MapReduce
Shivansh Yadav
Shivansh Yadav
Shivansh Yadav
Follow
Jun 30 '24
Introduction to Apache Hadoop & MapReduce
#
hadoop
#
dataengineering
#
bigdata
#
datascience
5
 reactions
Comments
Add Comment
3 min read
Databricks - Variant Type Analysis
Debashis Adak
Debashis Adak
Debashis Adak
Follow
Jun 29 '24
Databricks - Variant Type Analysis
#
databricks
#
spark
#
bigdata
#
datalake
4
 reactions
Comments
Add Comment
7 min read
Metadata for win — Apache Parquet
Rahul Dubey
Rahul Dubey
Rahul Dubey
Follow
May 25 '24
Metadata for win — Apache Parquet
#
python
#
bigdata
#
datascience
#
dataengineering
Comments
Add Comment
5 min read
Comprehensive Guide to Schema Inference with MongoDB Spark Connector in PySpark
Chetan Gupta
Chetan Gupta
Chetan Gupta
Follow
Jun 27 '24
Comprehensive Guide to Schema Inference with MongoDB Spark Connector in PySpark
#
pyspark
#
bigdata
#
mongodb
#
spark
Comments
Add Comment
3 min read
Working with Parquet files in Java using Carpet
JerĂłnimo LĂłpez
JerĂłnimo LĂłpez
JerĂłnimo LĂłpez
Follow
Jun 19 '24
Working with Parquet files in Java using Carpet
#
parquet
#
java
#
bigdata
#
dataengineering
1
 reaction
Comments
1
 comment
6 min read
Advanced Insights into Automated Data Processing Tools
Data Expertise
Data Expertise
Data Expertise
Follow
Jun 16 '24
Advanced Insights into Automated Data Processing Tools
#
automateddataprocessing
#
machinelearning
#
bigdata
#
datascience
1
 reaction
Comments
Add Comment
4 min read
How to Build an API with Strong Security Measures
Ovais
Ovais
Ovais
Follow
Jun 12 '24
How to Build an API with Strong Security Measures
#
api
#
bigdata
#
datascience
#
datamanagement
Comments
Add Comment
4 min read
Documenting Rate Limits and Throttling in REST APIs
Ovais
Ovais
Ovais
Follow
Jun 12 '24
Documenting Rate Limits and Throttling in REST APIs
#
api
#
bigdata
#
datamanagement
#
datascience
Comments
Add Comment
5 min read
GraphQL API Design Best Practices for Efficient Data Management
Ovais
Ovais
Ovais
Follow
Jun 12 '24
GraphQL API Design Best Practices for Efficient Data Management
#
api
#
datamanagement
#
bigdata
#
graphql
2
 reactions
Comments
1
 comment
5 min read
The current Lakehouse is like a false proposition
Judy
Judy
Judy
Follow
Jun 12 '24
The current Lakehouse is like a false proposition
#
lackhouse
#
bigdata
#
development
#
programming
6
 reactions
Comments
1
 comment
10 min read
Is distributed technology the panacea for big data processing?
Judy
Judy
Judy
Follow
Jun 6 '24
Is distributed technology the panacea for big data processing?
#
bigdata
#
processing
#
development
#
lauguage
7
 reactions
Comments
1
 comment
10 min read
Big Data: a ferramenta que precisamos.
Delmiro Ribeiro
Delmiro Ribeiro
Delmiro Ribeiro
Follow
May 26 '24
Big Data: a ferramenta que precisamos.
#
bigdata
#
database
#
datascience
#
backend
Comments
Add Comment
2 min read
PySpark: missing value
ChelseaLiu0822
ChelseaLiu0822
ChelseaLiu0822
Follow
Apr 18 '24
PySpark: missing value
#
pyspark
#
python
#
dataengineering
#
bigdata
Comments
Add Comment
2 min read
Stream Data at scale from millions of sources with Amazon Kinesis (Serverless)
Asanka Boteju
Asanka Boteju
Asanka Boteju
Follow
May 20 '24
Stream Data at scale from millions of sources with Amazon Kinesis (Serverless)
#
bigdata
#
kinesis
#
aws
#
strems
13
 reactions
Comments
Add Comment
7 min read
The Role of Data Integration in Healthcare Research and Precision Medicine
Ovais
Ovais
Ovais
Follow
May 13 '24
The Role of Data Integration in Healthcare Research and Precision Medicine
#
dataintegration
#
healthcare
#
datascience
#
bigdata
Comments
1
 comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account