Forem

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Everyone Building AI Research Tools Is Solving the Wrong Problem
Cover image for Everyone Building AI Research Tools Is Solving the Wrong Problem

Everyone Building AI Research Tools Is Solving the Wrong Problem

4
Comments
7 min read
Building a Local Code Search System with Ollama and AST-Aware RAG

Building a Local Code Search System with Ollama and AST-Aware RAG

1
Comments
4 min read
How We Use RAG for Knowledge Base Search in AutoBot

How We Use RAG for Knowledge Base Search in AutoBot

Comments
5 min read
80% of RAG Failures Start Here (And It's Not the LLM)
Cover image for 80% of RAG Failures Start Here (And It's Not the LLM)

80% of RAG Failures Start Here (And It's Not the LLM)

4
Comments
2 min read
ARKHEIN 0.1.0: The Great Decoupling
Cover image for ARKHEIN 0.1.0: The Great Decoupling

ARKHEIN 0.1.0: The Great Decoupling

Comments
3 min read
RAG Architecture in 2026: Building Smarter AI Applications

RAG Architecture in 2026: Building Smarter AI Applications

Comments
6 min read
Graph RAG does not need a graph database. It needs a database that does everything.
Cover image for Graph RAG does not need a graph database. It needs a database that does everything.

Graph RAG does not need a graph database. It needs a database that does everything.

10
Comments
10 min read
The “The Architecture Handbook for Milvus Vector Database” Book Review

The “The Architecture Handbook for Milvus Vector Database” Book Review

1
Comments
11 min read
Context Pruning Delivers Measurable ROI for Enterprise AI

Context Pruning Delivers Measurable ROI for Enterprise AI

Comments
1 min read
Context Pruning Unlocks Superior RAG Accuracy Metrics

Context Pruning Unlocks Superior RAG Accuracy Metrics

Comments
1 min read
How to Implement Semantic Pruning in Your RAG Stack

How to Implement Semantic Pruning in Your RAG Stack

Comments
1 min read
RAGの検索精度を3軸で測ったら最適解が条件で全く変わった

RAGの検索精度を3軸で測ったら最適解が条件で全く変わった

Comments
3 min read
Vector Databases for AI Agents: Which One Actually Works in Production?
Cover image for Vector Databases for AI Agents: Which One Actually Works in Production?

Vector Databases for AI Agents: Which One Actually Works in Production?

Comments
14 min read
Build Chatbot with RAG: Why Your Architecture Matters
Cover image for Build Chatbot with RAG: Why Your Architecture Matters

Build Chatbot with RAG: Why Your Architecture Matters

Comments
7 min read
Resilient Guest-Policy Retrieval: A Self-Healing Semantic Loop for Hotel Context
Cover image for Resilient Guest-Policy Retrieval: A Self-Healing Semantic Loop for Hotel Context

Resilient Guest-Policy Retrieval: A Self-Healing Semantic Loop for Hotel Context

Comments
19 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.