Forem

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Cosine Similarity Failed Our RAG on Exact Terms — BM25 Fixed It

Cosine Similarity Failed Our RAG on Exact Terms — BM25 Fixed It

1
Comments
6 min read
RetailRAG-AI: AI-Powered Retail Intelligence

RetailRAG-AI: AI-Powered Retail Intelligence

Comments
2 min read
Index-RAG: Citation-first approach to RAG
Cover image for Index-RAG: Citation-first approach to RAG

Index-RAG: Citation-first approach to RAG

1
Comments
5 min read
When CLAUDE.md Stops Working: Adding Vector Memory to Claude Code

When CLAUDE.md Stops Working: Adding Vector Memory to Claude Code

1
Comments
10 min read
AIGoat - AI Security Playground to Attack and Defend LLMs. All Running Locally
Cover image for AIGoat - AI Security Playground to Attack and Defend LLMs. All Running Locally

AIGoat - AI Security Playground to Attack and Defend LLMs. All Running Locally

2
Comments 1
3 min read
How I Built a Hallucination Detector for RAG Pipelines in Python

How I Built a Hallucination Detector for RAG Pipelines in Python

Comments 1
3 min read
The architecture of persistent AI memory: Beyond simple vector search
Cover image for The architecture of persistent AI memory: Beyond simple vector search

The architecture of persistent AI memory: Beyond simple vector search

Comments
2 min read
~1ms hybrid graph + vector queries (network is now the bottleneck)

~1ms hybrid graph + vector queries (network is now the bottleneck)

Comments
3 min read
Compound AI Systems: How I Connect Multiple Models in a Single Production Product

Compound AI Systems: How I Connect Multiple Models in a Single Production Product

Comments
2 min read
RAG Architecture: Building AI Apps That Know Your Data" platform
Cover image for RAG Architecture: Building AI Apps That Know Your Data" platform

RAG Architecture: Building AI Apps That Know Your Data" platform

1
Comments
10 min read
Why Your LLM Ignores Detailed Instructions (It's Not a Bug)

Why Your LLM Ignores Detailed Instructions (It's Not a Bug)

Comments
2 min read
Most GenAI chatbot tutorials stop at “call an LLM get an answer.”
Cover image for Most GenAI chatbot tutorials stop at “call an LLM get an answer.”

Most GenAI chatbot tutorials stop at “call an LLM get an answer.”

Comments
1 min read
The Next Frontier of AI Agent Runtimes: Observability, MCP, and High-Precision RAG
Cover image for The Next Frontier of AI Agent Runtimes: Observability, MCP, and High-Precision RAG

The Next Frontier of AI Agent Runtimes: Observability, MCP, and High-Precision RAG

5
Comments
3 min read
RAG finds chunks. TrailGraph finds answers. Here's the difference.

RAG finds chunks. TrailGraph finds answers. Here's the difference.

1
Comments
7 min read
Beyond Vector Search: Building a Clause Forest (FoC) Architecture for Financial RAG

Beyond Vector Search: Building a Clause Forest (FoC) Architecture for Financial RAG

Comments
7 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.