Forem

# bigdata

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Quick use of CDC: A new demo from lakesoul makes it easier to set up the environment

Quick use of CDC: A new demo from lakesoul makes it easier to set up the environment

8
Comments
5 min read
Big Data in Cloud Computing - AWS
Cover image for Big Data in Cloud Computing - AWS

Big Data in Cloud Computing - AWS

14
Comments
2 min read
4 best opensource projects about big data you should try out

4 best opensource projects about big data you should try out

16
Comments 3
3 min read
A new unified streaming and batch table storage solution similar to iceberg/hudi/delta lake but with several new functions

A new unified streaming and batch table storage solution similar to iceberg/hudi/delta lake but with several new functions

8
Comments
2 min read
[OPINIÃO] Construindo uma Carreira como Data Engineer

[OPINIÃO] Construindo uma Carreira como Data Engineer

2
Comments
2 min read
How to prepare for the GCP Professional Data Engineer certification
Cover image for How to prepare for the GCP Professional Data Engineer certification

How to prepare for the GCP Professional Data Engineer certification

35
Comments 7
8 min read
Characteristics of Big Data

Characteristics of Big Data

4
Comments
8 min read
Apache Spark Unit Testing Strategies

Apache Spark Unit Testing Strategies

9
Comments
1 min read
NodeJS - Get data from Redash v6 API
Cover image for NodeJS - Get data from Redash v6 API

NodeJS - Get data from Redash v6 API

6
Comments
2 min read
Building an Apache ECharts dashboard with React and Cube
Cover image for Building an Apache ECharts dashboard with React and Cube

Building an Apache ECharts dashboard with React and Cube

14
Comments
11 min read
[DICA] Adentre o universo da Engenharia de Dados com profissionais brasileiros que se tornaram referência na área!
Cover image for [DICA] Adentre o universo da Engenharia de Dados com profissionais brasileiros que se tornaram referência na área!

[DICA] Adentre o universo da Engenharia de Dados com profissionais brasileiros que se tornaram referência na área!

6
Comments
2 min read
What are the best practices while using BigQuery?
Cover image for What are the best practices while using BigQuery?

What are the best practices while using BigQuery?

11
Comments
2 min read
Building a Bubble Dashboard with Cube
Cover image for Building a Bubble Dashboard with Cube

Building a Bubble Dashboard with Cube

9
Comments
14 min read
[ARTIGO] Data Warehouse, Data Lake e Data Lakehouse: Conceitos e Diferenças
Cover image for [ARTIGO] Data Warehouse, Data Lake e Data Lakehouse: Conceitos e Diferenças

[ARTIGO] Data Warehouse, Data Lake e Data Lakehouse: Conceitos e Diferenças

6
Comments
3 min read
Fast Multivalue Look-ups For Huge Data Sets
Cover image for Fast Multivalue Look-ups For Huge Data Sets

Fast Multivalue Look-ups For Huge Data Sets

6
Comments
6 min read
Dagster: The Best Free and Open-Source Alternative to Airflow With Python!

Dagster: The Best Free and Open-Source Alternative to Airflow With Python!

5
Comments
1 min read
What is the SingleStore and why should we use it?
Cover image for What is the SingleStore and why should we use it?

What is the SingleStore and why should we use it?

12
Comments 2
3 min read
How to handle nested JSON with Apache Spark

How to handle nested JSON with Apache Spark

3
Comments
3 min read
Machine Learning Lifecycle Process
Cover image for Machine Learning Lifecycle Process

Machine Learning Lifecycle Process

45
Comments
4 min read
Quill- Most efficient Scala driver for Apache Cassandra and Spark
Cover image for Quill- Most efficient Scala driver for Apache Cassandra and Spark

Quill- Most efficient Scala driver for Apache Cassandra and Spark

2
Comments
4 min read
Presenting ML-based COVID-19 Risk Assessment App Pandemonium
Cover image for Presenting ML-based COVID-19 Risk Assessment App Pandemonium

Presenting ML-based COVID-19 Risk Assessment App Pandemonium

4
Comments
3 min read
Cleaning And Normalizing Data Using AWS Glue DataBrew
Cover image for Cleaning And Normalizing Data Using AWS Glue DataBrew

Cleaning And Normalizing Data Using AWS Glue DataBrew

14
Comments 3
9 min read
Introduction to Apache Spark, SparkQL, and Spark MLib.
Cover image for Introduction to Apache Spark, SparkQL, and Spark MLib.

Introduction to Apache Spark, SparkQL, and Spark MLib.

12
Comments
15 min read
Data Lake explained
Cover image for Data Lake explained

Data Lake explained

6
Comments
4 min read
Introduction to Hive(A SQL layer above Hadoop)
Cover image for Introduction to Hive(A SQL layer above Hadoop)

Introduction to Hive(A SQL layer above Hadoop)

8
Comments
9 min read
loading...