Forem

Kreuzberg

The fastest document intelligence engine for RAG developers.

Organization Settings Admin

Kreuzberg is an open-source MIT-licensed polyglot document intelligence framework with a fast Rust core. We build tools that help developers extract, process, and understand documents at scale, in 56+ formats.

Beyond the Model: Why Document Intelligence Is the Next AI Infrastructure Layer
Cover image for Beyond the Model: Why Document Intelligence Is the Next AI Infrastructure Layer

Beyond the Model: Why Document Intelligence Is the Next AI Infrastructure Layer

Comments
4 min read
The Haystack converter that handles 91+ file formats without a Cloud API
Cover image for The Haystack converter that handles 91+ file formats without a Cloud API

The Haystack converter that handles 91+ file formats without a Cloud API

Comments
7 min read
Document Structure Extraction with Kreuzberg
Cover image for Document Structure Extraction with Kreuzberg

Document Structure Extraction with Kreuzberg

Comments
7 min read
BM25 + Vector Search in One Query: kreuzberg-surrealdb + SurrealDB v3
Cover image for BM25 + Vector Search in One Query: kreuzberg-surrealdb + SurrealDB v3

BM25 + Vector Search in One Query: kreuzberg-surrealdb + SurrealDB v3

4
Comments
8 min read
How to Extract Text from PDF in Python (2026)
Cover image for How to Extract Text from PDF in Python (2026)

How to Extract Text from PDF in Python (2026)

Comments
5 min read
Kreuzberg vs. Unstructured.io: Benchmarks and Architecture Comparison (March 2026)
Cover image for Kreuzberg vs. Unstructured.io: Benchmarks and Architecture Comparison (March 2026)

Kreuzberg vs. Unstructured.io: Benchmarks and Architecture Comparison (March 2026)

Comments
6 min read
Building a RAG pipeline with Kreuzberg and LangChain
Cover image for Building a RAG pipeline with Kreuzberg and LangChain

Building a RAG pipeline with Kreuzberg and LangChain

1
Comments
6 min read
Kreuzberg v4.3.0 and comparative benchmarks
Cover image for Kreuzberg v4.3.0 and comparative benchmarks

Kreuzberg v4.3.0 and comparative benchmarks

Comments
3 min read
loading...