Forem

# parquet

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why I’m Switching to Parquet for Data Storage
Cover image for Why I’m Switching to Parquet for Data Storage

Why I’m Switching to Parquet for Data Storage

4
Comments
3 min read
From Python to ClickHouse: Parquet ETL with Go
Cover image for From Python to ClickHouse: Parquet ETL with Go

From Python to ClickHouse: Parquet ETL with Go

3
Comments
2 min read
The Carpet feature that nobody will use

The Carpet feature that nobody will use

Comments
4 min read
Working with Parquet files

Working with Parquet files

Comments
2 min read
Crawling web sites using “Data Prep Kit”

Crawling web sites using “Data Prep Kit”

Comments
4 min read
Turning Parquet File into a Queryable RESTful with DuckDB, Quarkus & Kotlin

Turning Parquet File into a Queryable RESTful with DuckDB, Quarkus & Kotlin

Comments
4 min read
The two versions of Parquet

The two versions of Parquet

2
Comments
5 min read
Compression algorithms in Parquet Java

Compression algorithms in Parquet Java

3
Comments 2
7 min read
Working with Parquet files in Java using Carpet

Working with Parquet files in Java using Carpet

1
Comments
6 min read
Working with Parquet files in Java using Protocol Buffers

Working with Parquet files in Java using Protocol Buffers

Comments
7 min read
Working with Parquet files in Java using Avro

Working with Parquet files in Java using Avro

1
Comments
10 min read
GeoParquet 1.0.0 is Here, and It's Changing the Geospatial Game
Cover image for GeoParquet 1.0.0 is Here, and It's Changing the Geospatial Game

GeoParquet 1.0.0 is Here, and It's Changing the Geospatial Game

2
Comments
4 min read
Push-Down-Predicates in Parquet and how to use them to reduce IOPS while reading from S3
Cover image for Push-Down-Predicates in Parquet and how to use them to reduce IOPS while reading from S3

Push-Down-Predicates in Parquet and how to use them to reduce IOPS while reading from S3

1
Comments
8 min read
Beyond CSV files: using Apache Parquet columnar files with Dask to reduce storage and increase performance. Try it now!

Beyond CSV files: using Apache Parquet columnar files with Dask to reduce storage and increase performance. Try it now!

7
Comments 4
5 min read
Snappy vs Zstd for Parquet in Pyarrow
Cover image for Snappy vs Zstd for Parquet in Pyarrow

Snappy vs Zstd for Parquet in Pyarrow

12
Comments
3 min read
Converting CSV to ORC/Parquet fast without a cluster!

Converting CSV to ORC/Parquet fast without a cluster!

7
Comments
6 min read
Processing parquet files in Golang

Processing parquet files in Golang

19
Comments
4 min read
PySpark and Parquet - Analysis
Cover image for PySpark and Parquet - Analysis

PySpark and Parquet - Analysis

14
Comments 1
3 min read
loading...